iSFF-DBNet:An improved text detection algorithm in e-commerce images

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (11): 2008-2017.

• Graphics and Images • Previous Articles Next Articles

iSFF-DBNet:An improved text detection algorithm in e-commerce images

LI Zhuo-xuan,ZHOU Ya-tong

（School of Electronic and Information Engineering,Hebei University of Technology,Tianjin 300401,China）

Received:2022-09-02 Revised:2022-12-14 Accepted:2023-11-25 Online:2023-11-25 Published:2023-11-16

Abstract

Abstract: Aiming at the problem that existing text detection models cannot accurately detect text locations due to complex backgrounds and variable text region shapes in e-commerce images, an improved text detection model, named Iterative Self-selective Feature Fusion DBNet (iSFF-DBNet), is proposed. Firstly, after extracting features from the backbone network, an attention mechanism is introduced in the process of building a Feature Pyramid Network (FPN), and an Iterative Self-selective Feature Fusion (iSFF) module is proposed to enhance the feature extraction ability of the model. Finally, a bilinear upsampling module is introduced to improve the adaptive performance of the differentiable binaryization module. Experimental results show that compared to the standard DBNet model, the recall and F-score of the improved model are increased by 6.0% and 2.4%, respectively, in the text detection task of the ICPR MTWI 2018 web-scale image dataset. Compared with other text detection models, this model achieves a balance between accuracy and recall, and can detect text more accurately.

Key words:

character detection, multi-scale feature, feature fusion, deep learning

LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images[J]. Computer Engineering & Science, 2023, 45(11): 2008-2017.

[1]	MA Jin-lin, YAN Qi, MA Zi-ping. A multi-layer mask recognition method for Tangut characters [J]. Computer Engineering & Science, 2024, 46(12): 2227-2238.
[2]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[3]	CHEN Qing-jiang, SHAO Fei, WANG Xuan-jun. Hybrid U-shaped network and Transformer for image deblurring [J]. Computer Engineering & Science, 2024, 46(10): 1843-1851.
[4]	CHEN Lei, LIANG Zheng-you, SUN Yu, CAI Jun-min. Mobile monocular depth estimation based on multi-scale feature fusion [J]. Computer Engineering & Science, 2024, 46(09): 1616-1524.
[5]	LIU Xiao-hua, XU Ru-zhi, YANG Cheng-yue. A Chinese named entity recognition model based on multi-feature fusion embedding#br# [J]. Computer Engineering & Science, 2024, 46(08): 1473-1481.
[6]	LI Xin-jie, WANG Wen-jun, DONG Ling, LAI Hua, YU Zheng-tao, GAO Sheng-xiang, . An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion [J]. Computer Engineering & Science, 2024, 46(05): 937-944.
[7]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.
[8]	YU Tian-ci, GAO Shang. A code summarization generation model fusing multi-structure data [J]. Computer Engineering & Science, 2024, 46(04): 667-675.
[9]	YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion [J]. Computer Engineering & Science, 2024, 46(02): 253-263.
[10]	JIANG Zhi-peng, WANG Zi-quan, ZHANG Yong-sheng, YU Ying, CHENG Bin-bin, ZHAO Long-hai, ZHANG Meng-wei. A vehicle object detection algorithm in UAV video stream based on improved Deformable DETR [J]. Computer Engineering & Science, 2024, 46(01): 91-101.
[11]	ZHANG Wen-hao, QU Shao-jun. Retinal vessel segmentation based on multi-scale attention feature fusion network with dual-decoder structure [J]. Computer Engineering & Science, 2023, 45(12): 2175-2185.
[12]	DONG Zi-ping, CHEN Shi-guo, LIAO Guo-qing. A dense multi-face detection algorithm based on YOLOv5s [J]. Computer Engineering & Science, 2023, 45(10): 1838-1846.
[13]	ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion [J]. Computer Engineering & Science, 2023, 45(09): 1602-1610.
[14]	CUI Ke-bin, CUI Ye-wei. A circuit breaker moving contact tracking methods based on convolution and Transformer [J]. Computer Engineering & Science, 2023, 45(07): 1236-1244.
[15]	LI Xiao-lin, WANG Fu-gang, ZHANG Peng-fei, ZHANG Lin-yu, . YOLOv5s algorithm optimization based on multi-scale feature extraction [J]. Computer Engineering & Science, 2023, 45(06): 1054-1062.

iSFF-DBNet:An improved text detection algorithm in e-commerce images

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments