• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (4): 699-708.

• 图形与图像 • 上一篇    下一篇

基于自适应特征融合的低质量钢印字符检测和识别

吕淑静,娄鹏杰,彭世全,赵春龙,刘运丹,吕岳   

  1. (1.华东师范大学上海市多维度信息处理重点实验室,上海 200241;
    2.齐齐哈尔四达铁路设备有限责任公司,黑龙江 齐齐哈尔 161000)

  • 收稿日期:2024-06-21 修回日期:2024-09-13 出版日期:2026-04-25 发布日期:2026-04-30

Low-quality steel stamp character detection and recognition based on adaptive feature fusion

Lv Shujing,LOU Pengjie,PENG Shiquan,ZHAO Chunlong,LIU Yundan,Lv Yue   

  1. (1.Shanghai Key Laboratory of Multidimensional Information Processing,East China Normal University,Shanghai 200241;
    2.Qiqihar Sida Railway Equipment Co.,Ltd.,Qiqihar 161000,China)
  • Received:2024-06-21 Revised:2024-09-13 Online:2026-04-25 Published:2026-04-30

摘要: 针对金属制品钢印所面临的字符倾斜、模糊、字体不统一及铁锈污渍干扰等问题,提出一种基于自适应特征融合的字符检测模型YOLO-CHAR,采用MobileNet特征提取网络动态调整通道特征权重,增强模型对关键特征的捕捉能力,在特征融合层采用GFPN网络结构和SimAM注意力机制,灵活捕捉多尺度特征并加强特征融合能力;基于该字符检测模型,设计并实现了一套低质量火车轮轴钢印字符检测识别系统,该系统已投入使用,轮轴的整体识别日均准确率达到92%以上,满足现场使用要求。


关键词: 文本检测, 文本识别, 钢印字符, 自适应特征融合, 注意力机制

Abstract: To address the challenges faced by stamp character detection on metal products, such as character tilt, blurriness, inconsistent fonts, and interference from rust stains, a character detection model based on adaptive feature fusion, named YOLO-CHAR, is proposed. This model employs the MobileNet feature extraction network to dynamically adjust the weights of channel features, enhancing the model’s ability to capture key features. At the feature fusion layer, it utilizes the generalized feature pyramid network(GFPN) structure and the simplified attention module(SimAM) attention mechanism to flexibly capture multi-scale features and strengthen feature fusion capabilities. Based on this character detection model, a low-quality train wheelsets stamp character detection and recognition system is designed and implemented. This system has been put into use, achieving an overall daily average recognition accuracy of over 92% for wheelsets, which meets the on-site operational requirements.

Key words: text detection, text recognition, steel stamp character, adaptive feature fusion, attention mechanism