• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (11): 2008-2017.

• Graphics and Images • Previous Articles     Next Articles

iSFF-DBNet:An improved text detection algorithm in e-commerce images

LI Zhuo-xuan,ZHOU Ya-tong   

  1. (School of Electronic and Information Engineering,Hebei University of Technology,Tianjin 300401,China)
  • Received:2022-09-02 Revised:2022-12-14 Accepted:2023-11-25 Online:2023-11-25 Published:2023-11-16

Abstract: Aiming at the problem that existing text detection models cannot accurately detect text locations due to complex backgrounds and variable text region shapes in e-commerce images, an improved text detection model, named Iterative Self-selective Feature Fusion DBNet (iSFF-DBNet), is proposed. Firstly, after extracting features from the backbone network, an attention mechanism is introduced in the process of building a Feature Pyramid Network (FPN), and an Iterative Self-selective Feature Fusion (iSFF) module is proposed to enhance the feature extraction ability of the model. Finally, a bilinear upsampling module is introduced to improve the adaptive performance of the differentiable binaryization module. Experimental results show that compared to the standard DBNet model, the recall and F-score of the improved model are increased by 6.0% and 2.4%, respectively, in the text detection task of the ICPR MTWI 2018 web-scale image dataset. Compared with other text detection models, this model achieves a balance between accuracy and recall, and can detect text more accurately.

Key words: