• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (3): 488-499.

• 图形与图像 • 上一篇    下一篇

融合多尺度信息和特征映射关系的层次多粒度图像分类

滕尚志,梅长旺,游新冬,吕学强


  

  1. (北京信息科技大学网络文化与数字传播北京市重点实验室,北京 100192)



  • 收稿日期:2024-05-16 修回日期:2024-10-09 出版日期:2026-03-25 发布日期:2026-03-25
  • 基金资助:
    国家自然科学基金(62202061,62171043);北京市自然科学基金(4232025);北京市教委科研计划科技一般项目(KM202311232002)

Integrating multi-scale information and feature mapping relationships for hierarchical multi-granularity image classification

TENG Shangzhi,MEI Changwang,YOU Xindong,Lv Xueqiang   

  1. (Beijing Key Laboratory of Internet Culture & Digital Dissemination Research,
    Beijing Information Science & Technology University,Beijing 100192,China)
  • Received:2024-05-16 Revised:2024-10-09 Online:2026-03-25 Published:2026-03-25

摘要: 为挖掘细粒度图像在不同粒度下的细节纹理信息,关注层次特征之间的关系,提出了一种融合多尺度信息和特征映射关系的层次多粒度图像分类方法。首先,提取骨干网络的中级语义特征作为图像在不同类别粒度下的局部细节特征,并与对应类别粒度的高级语义特征进行融合。其次,使用特征映射算法表示类别层次之间的映射关系,对各层次的多粒度特征进行融合。最后,提出重排序分类损失RCL来提升各层次类别的分类准确度,利用类别中心三元组损失TCL在细粒度特征空间中将对象与其类中心的距离尽可能拉近,拉远与不同类中心的距离。在CUB-200-2011,FGVC-Aircraft和Stanford Cars这3个层次多粒度数据集上进行的评测结果表明,所提方法的细粒度图像分类准确度分别达到了88.8%,94.2%和95.1%,加权平均精确率wAP分别达到了90.4%,95.1%和95.1%,充分体现了所提方法在层次多粒度图像分类任务上的有效性和先进性。

关键词: 图像分类, 细粒度图像, 层次多粒度图像, 多尺度信息, 特征映射

Abstract: To explore the detailed texture information of fine-grained images at different granularity levels and to focus on the relationships between hierarchical features, a hierarchical multi-granularity image classification method that integrates multi-scale information and feature mapping relationships is proposed. Firstly, mid-level semantic features extracted from the backbone network are utilized as local detailed features of images at different category granularities and fused with corresponding high-level semantic features at the same granularities. Then, a feature mapping algorithm is employed to represent the mapping relationships between category hierarchies, enabling the fusion of multi-granularity features across different levels. Finally, a reordering classification loss (RCL) is introduced to enhance classification accuracy across hierarchical categories, while a triplet center loss (TCL) is utilized to minimize the distance between objects and their class centers in the fine-grained feature space and maximize the distance from different class centers. Evaluation results on 3 hierarchical multi-granularity datasets-CUB- 200-2011, FGVC-Aircraft, and Stanford Cars-demonstrate that the proposed method achieves fine-grained image classification performance of 88.8%, 94.2%, and 95.1%, respectively, with weighted average precision (wAP) values of 90.4%, 95.1%, and 95.1%. These results fully validate the effectiveness and advanced nature of the proposed method for hierarchical multi-granularity image classification tasks.

Key words: image classification, fine-grained image, hierarchical multi-granularity image, multi-scale information, feature mapping