A multi-branch fine-grained recognition method based on dynamic localization and feature fusion

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (02): 253-263.

• Graphics and Images • Previous Articles Next Articles

A multi-branch fine-grained recognition method based on dynamic localization and feature fusion

YANG Xiao-qiang,HUANG Jia-cheng

(College of Computer Science & Technology,Xi’an University of Science and Technology,Xi’an 710000,China)

Received:2022-12-05 Revised:2023-02-26 Accepted:2024-02-25 Online:2024-02-25 Published:2024-02-24

Abstract

Abstract: To solve the classification difficulties of small inter-class differences and large intra-class differences in fine-grained classification, an improved end-to-end fine-grained classification model (TBformer) is proposed based on Swin Transformer. In view of the interference of complex background on network recognition, the dynamic location module (DLModule) combining ECA, Resnet50 and SCDA is used to capture key objects, and a three-branch feature extraction module based on DLModule is designed to improve the ability of target discriminant feature extraction. In order to fully tap the rich fine-grained information contained in the three-branch features, a feature fusion method based on ECA is proposed to enhance the comprehensiveness and accuracy of the features, and improve the robustness of the network for fine-grained classification. The experimental results show that compared with the basic method, the accuracy of TBformer is improved by 3.19% in CUB-200-2011, 3.47% in Stanford Dogs and 1.09% in NABirds.

Key words: fine grained recognition, feature fusion, attention mechanism, multiple branches

YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion[J]. Computer Engineering & Science, 2024, 46(02): 253-263.

[1]	XU Chao, RUAN Rongyao, CHEN Yong, . A blockchain-based medical data auditing method [J]. Computer Engineering & Science, 2025, 47(01): 95-106.
[2]	CHEN Zhaobo, ZHANG Lin, MA Xiaoxuan. Video anomaly detection with improved attention hybrid auto-encoder [J]. Computer Engineering & Science, 2025, 47(01): 130-139.
[3]	MA Jin-lin, YAN Qi, MA Zi-ping. A multi-layer mask recognition method for Tangut characters [J]. Computer Engineering & Science, 2024, 46(12): 2227-2238.
[4]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[5]	LIU Guo-qi, HE Ting-nian, RONG Yi-xuan, LI Zhuo-ran . A point of interest recommendation model based on tracks and friend relationship of users [J]. Computer Engineering & Science, 2024, 46(09): 1693-1701.
[6]	LIU Xiao-hua, XU Ru-zhi, YANG Cheng-yue. A Chinese named entity recognition model based on multi-feature fusion embedding#br# [J]. Computer Engineering & Science, 2024, 46(08): 1473-1481.
[7]	ZHANG Yong-zhi, HE Ke-ren, GE Jue. Low-altitude remote sensing image object detection based on improved YOLOv7 network [J]. Computer Engineering & Science, 2024, 46(07): 1269-1277.
[8]	WANG Ze-yu, XU Hui-ying, ZHU Xin-zhong, LI Chen, LIU Zi-yang, WANG Zi-yi. An improved dense pedestrian detection algorithm based on YOLOv8: MER-YOLO [J]. Computer Engineering & Science, 2024, 46(06): 1050-1062.
[9]	DENG Xiang-yu, PEI Hao-yuan, SHENG Ying. Facial expression recognition based on network fusion to improve MobileViT [J]. Computer Engineering & Science, 2024, 46(06): 1072-1080.
[10]	ZHANG Yu-ying, ZHU Guang-li, TAN Guang-pu, . A financial implicit sentiment analysis model based on sentiment enhancement and semantic dependency [J]. Computer Engineering & Science, 2024, 46(06): 1112-1120.
[11]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[12]	ZHAO Jin-yuan, JIA Di. A multi-person pose estimation correction algorithm based on improved YOLOv5 [J]. Computer Engineering & Science, 2024, 46(05): 852-860.
[13]	LI Xin-jie, WANG Wen-jun, DONG Ling, LAI Hua, YU Zheng-tao, GAO Sheng-xiang, . An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion [J]. Computer Engineering & Science, 2024, 46(05): 937-944.
[14]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[15]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.

A multi-branch fine-grained recognition method based on dynamic localization and feature fusion

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments