Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (02): 253-263.
• Graphics and Images • Previous Articles Next Articles
YANG Xiao-qiang,HUANG Jia-cheng
Received:
2022-12-05
Revised:
2023-02-26
Accepted:
2024-02-25
Online:
2024-01-25
Published:
2024-02-24
YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion[J]. Computer Engineering & Science, 2024, 46(02): 253-263.
[1] | Zhang N,Donahue J,Girshick R,et al.Part-based R-CNNs for fine-grained category detection[C]∥Proc of the 13th European Conference on Computer Vision,2014: 834-849. |
[2] | Branson S,Horn G V,Belongie S,et al.Bird species categorization using pose normalized deep convolutional nets[J].arXiv:1406.2952,2014. |
[3] | Huang S, Xu Z,Tao D,et al.Part-stacked CNN for fine-grained visual categorization[C]∥Proc of the IEEE Confe- rence on Computer Vision and Pattern Recognition,2016: 1173-1182. |
[4] | Lin D,Shen X,Lu C,et al.Deep LAC: Deep localization,alignment and classification for fine-grained recognition[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2015: 1666-1674. |
[5] | Wei X S,Xie C W,Wu J.Mask-CNN: Localizing parts and selecting descriptors for fine-grained image recognition[J].Pattern Recognition,2018,76: 704-714. |
[6] | Xiao T,Xu Y,Yang K,et al.The application of two-level attention models in deep convolutional neural network for fine-grained image classification[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2015: 842-850. |
[7] | Liu X,Xia T,Wang J,et al.Fully convolutional attention networks for fine-grained recognition[J]. arXiv:1603.06765,2016. |
[8] | Zhao B,Wu X,Feng J S,et al.Diversified visual attention networks for fine-grained object classification[J].IEEE Transactions on Multimedia,2017,19(6): 1245-1256. |
[9] | Fu J,Zheng H,Tao M.Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition[C]∥Proc of the IEEE Conference on Computer Vision & Pattern Recognition,2017: 4438-4446. |
[10] | Zheng H,Fu J,Tao M,et al.Learning multi-attention con- volutional neural network for fine-grained image recognition[C]∥Proc of the IEEE International Conference on Computer Vision,2017: 5209-5217. |
[11] | Yang Z,Luo T,Wang D,et al.Learning to navigate for fine-grained classification[C]∥Proc of the European Conference on Computer Vision,2018: 420-435. |
[12] | Zhuang P,Wang Y,Qiao Y.Learning attentive pairwise interaction for fine-grained classification[C]∥Proc of the AAAI Conference on Artificial Intelligence,2020:13130-13137. |
[13] | Gao Y,Han X T,Wang X,et al.Channel interaction networks for fine-grained image categorization[C]∥Proc of the AAAI Conference on Artificial Intelligence,2020:10818-10825. |
[14] | He J,Chen J,Liu S,et al.TransFG: A transformer architecture for fine-grained recognition[C]∥Proc of the AAAI Conference on Artificial Intelligence,2022:852-860. |
[15] | Zhang Y,Cao J,Zhang L,et al.A free lunch from VIT: Adaptive attention multi-scale fusion transformer for fine-grained visual recognition[C]∥Proc of IEEE International Conference on Acoustics,Speech and Signal Processing,2022: 3234-3238. |
[16] | Liu X,Wang L,Han X.Transformer with peak suppression and knowledge guidance for fine-grained image recognition[J].Neurocomputing,2022,492: 137-149. |
[17] | Conde M V,Turgutlu K.Exploring vision transformers for fine-grained classification[J].arXiv:2106.10587,2021. |
[18] | Wang J, Yu X,Gao Y.Feature fusion vision transformer fine-grained visual categorization[J].arXiv:2107.02341,2021. |
[19] | Liu Z,Lin Y,Cao Y,et al.Swin transformer: Hierarchical vision transformer using shifted windows[C]∥Proc of the IEEE/CVF International Conference on Computer Vision,2021: 10012-10022. |
[20] | Wang Q,Wu B,Zhu P,et al.ECA-Net: Efficient channel attention for deep convolutional neural networks[C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020: 11534-11542. |
[21] | He K,Zhang X,Ren S,et al.Deep residual learning for image recognition[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2016: 770-778. |
[22] | Wei X S, Luo J H, Wu J,et al.Selective convolutional descriptor aggregation for fine-grained image retrieval[J].IEEE Transactions on Image Processing,2017,26(6): 2868-2881. |
[23] | Simonyan K,Zisserman A.Very deep convolutional networks for large-scale image recognition[J]. arXiv:1409.1556,2014. |
[24] | Wah C, Branson S,Welinder P.The Caltech-UCSD birds-200-2011 dataset[EB/OL].[2020-12-28].http:∥www.Vision.calt-ech.edu/visipedia/CUB-200-2011.html. |
[25] | Khosla A,Jayadevaprakash N,Yao B,et al.Novel dataset for fine-grained image categorization:Stanford dogs[C]∥Proc of the IEEE International Conference on Computer Vision,2011: 167-169. |
[26] | Van Horn G, Branson S,Farrell R,et al.Building a bird re- cognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2015: 595-604. |
[27] | Dubey A,Gupta O,Guo P,et al.Pairwise confusion for fine-grained visual classification[C]∥Proc of the European Conference on Computer Vision,2018: 70-86. |
[28] | Zheng H, Fu J, Zha Z J,et al.Looking for the devil in the details: Learning trilinear attention sampling network for fine-grained image recognition [C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2019 : 5012-5021. |
[29] | 李宽宽,刘立波.双线性聚合残差注意力的细粒度图像分类模型[J].计算机科学与探索,2022,16(4):938-949. |
Li Kuan-kuan,Liu Li-bo.Fine-grained image classification model based on bilinear aggregate residual attention[J].Journal of Frontiers of Computer Science and Technology,2022,16(4): 938-949. | |
[30] | Ji R,Wen L,Zhang L,et al.Attention convolutional binary neural tree for fine-grained visual categorization[C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2020: 10468-10477. |
[31] | Liang Y Z,Zhu L C,Wang X H,et al.Penalizing the hard example but not too much: A strong baseline for fine-grained visual classification[J].IEEE Transactions on Neural Networks and Learning Systems,2022:1-12.doi:10.1109/TNNLS.2022.3213563. |
[32] | Wang Z,Wang S,Li H,et al.Graph-propagation based correlation learning for weakly supervised fine-grained image classification[C]∥Proc of the AAAI Conference on Artificial Intelligence,2020:12289-12296. |
[33] | Xu S, Chang D, Xie J,et al.Grad-CAM guided channel- spatial atten-tion module for fine-grained visual classification[C]∥Proc of 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing,2021: 1-6. |
[34] | Rong Y, Xu W,Akata Z,et al.Human attention in fine-grained classification[J]. arXiv:2111.01628,2021. |
[35] | Zhang T,Chang D,Ma Z,et al.Progressive co-attention network for fine-grained visual classification[C]∥Proc of 2021 International Conference on Visual Communications and Image Processing,2021: 1-5. |
[36] | Zhang S,Du R,Chang D,et al.Knowledge transfer based fine-grained visual classification[C]∥Proc of 2021 IEEE International Conference on Multimedia and Expo,2021: 1-6. |
[37] | Guo P,Farrell R.Aligned to the object,not to the image: A unified pose-aligned representation for fine-grained recognition[C]∥Proc of 2019 IEEE Winter Conference on Applications of Computer Vision,2019: 1876-1885. |
[38] | 丁文谦,余鹏飞,李海燕,等.基于Xception网络的弱监督细粒度图像分类[J].计算机工程与应用,2022,58(2):235-243. |
Ding Wen-qian,Yu Peng-fei,Li Hai-yan,et al.Weakly supervised fine-grained image classification based on Xception network[J].Computer Engineering and Applications,2022,58(2):235-243. | |
[39] | Zhao Y,Yan K,Huang F,et al.Graph-based high-order relation discovery for fine-grained recognition[C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2021: 15079-15088. |
[40] | Du R,Chang D,Bhunia A K,et al.Fine-grained visual classification via progressive multi-granularity training of Jigsaw patches[C]∥Proc of European Conference on Computer Vision,2020: 153-168. |
[41] | Hu Y,Liu X,Zhang B,et al.Alignment enhancement network for fine-grained visual categorization[J].ACM Transactions on Multimedia Computing,Communications,and Applications,2021,17(1s):Article No.:12. |
[42] | Zhao Y,Li J,Chen X,et al.Part-guided relational transformers for fine-grained visual recognition[J].IEEE Trans- actions on Image Processing,2021,30: 9470-9481. |
[43] | Li H,Zhang X,Tian Q,et al.Attribute mix: Semantic data augmentation for fine grained recognition[C]∥Proc of 2020 IEEE International Conference on Visual Communications and Image Processing,2020: 243-246. |
[44] | Ge W F, Lin X R,Yu Y Z.Weakly supervised complementary parts models for fine-grained image classification from the bottom up[C]∥Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2019: 3034-3043. |
[45] | Lin T Y, RoyChowdhury A,Maji S.Bilinear CNN models for fine-grained visual recognition[C]∥Proc of the IEEE International Conference on Computer Vision,2015: 1449-1457. |
[46] | Dubey A,Gupta O,Raskar R,et al.Maximum entropy fine-grained classification[C]∥Proc of the 32nd International Conference on Neural Information Processing Systems,2018: 635-645. |
[47] | Liu X, Xia T,Wang J,et al.Fully convolutional attention networks for gine-grained recognition[J]. arXiv:1603.06765,2021. |
[48] | Li S, Zang Z, Li S Z. Exploring localization for self- supervised fine-grained contrastive learning[J]. arXiv:2106.15788,2016. |
[49] | Sun M,Yuan Y,Zhou F,et al.Multi-attention multi-class constraint for fine-grained image recognition[C]∥Proc of the European Conference on Computer Vision,2018: 805-821. |
[50] | Woo S,Park J,Lee J Y,et al.CBAM: Convolutional block attention module[C]∥Proc of the European Conference on Computer Vision,2018: 3-19. |
[51] | Cao Y,Xu J,Lin S,et al.GCNet: Non-local networks meet squeeze-excitation networks and beyond[C]∥Proc of 2019 IEEE/CVF International Conference on Computer Vision Workshop,2019: 1971-1980. |
[52] | Hu J,Shen L,Sun G.Squeeze-and-excitation networks[C]∥Proc of the IEEE Conference on Computer Vision and Pattern Recognition,2018: 7132-7141. |
[53] | Rother C,Kolmogorov V,Blake A.Interactive foreground extraction using iterated graph cuts[J].ACM Transactions on Graphics,2004,23(3):309-314. |
[1] | HUANG Zhen-wei, CHEN Wei, WANG Wen-jie, LU Jin-tong. Underwater vehicle target detection and experiment based on improved RetinaNet network [J]. Computer Engineering & Science, 2024, 46(02): 264-271. |
[2] | ZHANG Huan, LI Wei-jiang, . Distant supervision relation extraction based on type attention and GCN [J]. Computer Engineering & Science, 2024, 46(02): 316-324. |
[3] | JIANG Zhi-peng, WANG Zi-quan, ZHANG Yong-sheng, YU Ying, CHENG Bin-bin, ZHAO Long-hai, ZHANG Meng-wei. A vehicle object detection algorithm in UAV video stream based on improved Deformable DETR [J]. Computer Engineering & Science, 2024, 46(01): 91-101. |
[4] | WANG Shan-shan, WANG Meng-zhu, LUO Zhi-gang. A focally discriminative loss for unsupervised domain adaptation method [J]. Computer Engineering & Science, 2024, 46(01): 132-141. |
[5] | LIANG Xiu-man, ZHOU Jia-run, YANG Ruo-lan. LPD-YOLO:Lightweight obscured pedestrian detection model [J]. Computer Engineering & Science, 2023, 45(12): 2197-2205. |
[6] | JIAO Jia-hui, MA Si-yuan, SONG Yu, SONG Wei. Bi-modal music genre classification model MGTN based on convolutional attention mechanism [J]. Computer Engineering & Science, 2023, 45(12): 2226-2236. |
[7] | JIA Kang, LI Xiao-nan, LI Guan-yu. A graph similarity computation model based on adaptive structure aware pooling graph matching [J]. Computer Engineering & Science, 2023, 45(11): 1999-2007. |
[8] | LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images [J]. Computer Engineering & Science, 2023, 45(11): 2008-2017. |
[9] | DONG Zi-ping, CHEN Shi-guo, LIAO Guo-qing. A dense multi-face detection algorithm based on YOLOv5s [J]. Computer Engineering & Science, 2023, 45(10): 1838-1846. |
[10] | ZHANG Qian-kun, HAN Hu, HAO Jun. Aspect-level sentiment classification based on dual attention fusion knowledge [J]. Computer Engineering & Science, 2023, 45(10): 1866-1873. |
[11] | LI Bao-min, WANG Xiao-peng, SUN Qian-rong, ZHANG Jun-ping. A forest fire image early warning detection method based on probabilistic two-stage CenterNet2 [J]. Computer Engineering & Science, 2023, 45(10): 1884-1890. |
[12] | CHEN Hai-yong, L Cheng-jie, DU Chun, CHEN Peng. A Siamese attention-gated fusion encoding-decoding network for remote sensing image change detection [J]. Computer Engineering & Science, 2023, 45(09): 1593-1601. |
[13] | ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion [J]. Computer Engineering & Science, 2023, 45(09): 1602-1610. |
[14] | MA Si-yuan, JIAO Jia-hui, REN Sheng-qi, SONG Wei. Missing value filling for multi-variable urban air quality data based on attention mechanism [J]. Computer Engineering & Science, 2023, 45(08): 1354-1364. |
[15] | YIN Chun-yong, FENG Meng-xue. A semi-supervised log anomaly detection method based on attention mechanism [J]. Computer Engineering & Science, 2023, 45(08): 1405-1415. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
湘公网安备 43010502000083号
湘ICP备10006030号
Copyright © Computer Engineering & Science, All Rights Reserved.
Address:109 Deya Rd,Changsha,hunan(410073) Tel: 0731-87002567 Email: jsjgcykx@vip.163.com
Powered by Beijing Magtech Co., Ltd.