• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (02): 253-263.

• Graphics and Images • Previous Articles     Next Articles

A multi-branch fine-grained recognition method based on dynamic localization and feature fusion

YANG Xiao-qiang,HUANG Jia-cheng   

  1. (College  of Computer Science & Technology,Xi’an University of Science and Technology,Xi’an 710000,China)
  • Received:2022-12-05 Revised:2023-02-26 Accepted:2024-02-25 Online:2024-02-25 Published:2024-02-24

Abstract: To solve the classification difficulties of small inter-class differences and large intra-class differences in fine-grained classification, an improved end-to-end fine-grained classification model (TBformer) is proposed based on Swin Transformer. In view of the interference of complex background on network recognition, the dynamic location module (DLModule) combining ECA, Resnet50 and SCDA is used to capture key objects, and a three-branch feature extraction module based on DLModule is designed to improve the ability of target discriminant feature extraction. In order to fully tap the rich fine-grained information contained in the three-branch features, a feature fusion method based on ECA is proposed to enhance the comprehensiveness and accuracy of the features, and improve the robustness of the network for fine-grained classification. The experimental results show that compared with the basic method, the accuracy of TBformer is improved by 3.19% in CUB-200-2011, 3.47% in Stanford Dogs and 1.09% in NABirds.

Key words: fine grained recognition, feature fusion, attention mechanism, multiple branches