基于注意力机制的特征融合语义分割模型

doi:10.3969/j.issn.1007-130X.2026.05.013

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (5): 898-905.doi: 10.3969/j.issn.1007-130X.2026.05.013

基于注意力机制的特征融合语义分割模型

马冬梅，朱启荣，吕雪龙

（西北师范大学物理与电子工程学院，甘肃兰州 730070）

收稿日期:2024-06-27 修回日期:2025-01-06 出版日期:2026-05-25 发布日期:2026-05-21
基金资助:
国家自然科学基金（61961037）

A feature fusion semantic segmentation model based on attention mechanism

MA Dongmei,ZHU Qirong,Lv Xuelong

(College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China)

Received:2024-06-27 Revised:2025-01-06 Online:2026-05-25 Published:2026-05-21

摘要/Abstract

摘要： 针对现有的语义分割模型DeepLabV3+容易出现误分割、分割精度低以及细节信息丢失严重等问题，提出了一种基于注意力机制的融合语义分割模型。首先，在该模型中的空洞卷积分支级联一个可切换空洞卷积，使其更加灵活地适应不同尺度的特征，减少误分割现象；其次，引入RFEM模块，捕获浅层特征多尺度信息以及不同范围的依赖关系，提高模型的性能；再次，提取模型的中间层特征，并利用ELAFF模块与其深层特征融合，使模型恢复在下采样过程中丢失的细节信息；最后，添加高效局部注意力，使模型更加关注图像信息，减少背景干扰。在PASCAL VOC 2012数据集上的实验结果表明，相比原模型，所提模型的平均交并比提升2.36个百分点，平均像素准确度提升1.60个百分点，可有效改善模型的分割性能。

关键词: 注意力机制, 语义分割, DeepLabV3+, 特征融合

Abstract: To address the issues of mis-segmentation, low segmentation accuracy, and severe loss of detailed information commonly encountered in the existing DeepLabV3+ semantic segmentation model, a feature-fusion semantic segmentation model based on an attention mechanism is proposed. Firstly, a switchable atrous convolution is cascaded within the dilated convolution branch of the model, enabling it to adapt more flexibly to features at different scales and thereby reducing mis-segmentation. Additionally, an RFEM module is introduced to capture multi-scale information from shallow features and depen- dencies across different ranges, enhancing the model’s performance. Furthermore, intermediate-layer features of the model are extracted and fused with its deep features using the ELAFF module, enabling the model to recover detailed information lost during the downsampling process. Finally, an efficient local attention mechanism is added to make the model focus more on image information and reduce background interference. Experimental results on the PASCAL VOC 2012 dataset demonstrate that, compared to the original model, the proposed model achieves a 2.36 percentage points increase in mean intersection-over-union (mIoU) and a 1.60 percentage points improvement in mean pixel accuracy (MPA), effectively enhancing the model’s segmentation performance.

Key words: attention mechanism, semantic segmentation, DeepLabV3+, feature fusion

马冬梅, 朱启荣, 吕雪龙. 基于注意力机制的特征融合语义分割模型[J]. 计算机工程与科学, 2026, 48(5): 898-905.

MA Dongmei, ZHU Qirong, Lv Xuelong. A feature fusion semantic segmentation model based on attention mechanism[J]. Computer Engineering & Science, 2026, 48(5): 898-905.

[1]	耿焕同, 范子辰, 蒋骏, 刘振宇, 李嘉兴. 任务提示融合的端到端视觉多任务学习模型[J]. 计算机工程与科学, 2026, 48(3): 456-466.
[2]	杨梅, 刘司南, 潘臻, 高磊, 闵帆. 面向农业地块提取的边缘-语义协同双分支解码网络[J]. 计算机工程与科学, 2026, 48(3): 444-455.
[3]	张洋, 胡慧君, 刘茂福. 基于全景语义和多层次特征融合的方面级多模态情感分析[J]. 计算机工程与科学, 2026, 48(2): 341-352.
[4]	曹利, 徐慧英, 谢刚, 李毅, 黄晓, 陈昊, 朱信忠. ASOD-YOLO：基于YOLOv8n改进的航空小目标检测模型[J]. 计算机工程与科学, 2026, 48(1): 133-145.
[5]	李燕, 樊新宇, 陈芹. 用于土地覆盖分割的多路径多尺度注意力网络[J]. 计算机工程与科学, 2026, 48(1): 108-118.
[6]	张胜裕1, 2, 3, 宋慧慧2, 3, 4. 基于特征解耦的双流网络模型全色锐化[J]. 计算机工程与科学, 2025, 47(9): 1628-1637.
[7]	蒲小莉, 赖惠成, 高古学. BF-YOLO：基于YOLOv8改进的小目标检测算法[J]. 计算机工程与科学, 2025, 47(8): 1425-1436.
[8]	徐梦繁, 黄微, 古倬铭. 基于多级对抗均值教师网络的夜间城市景观语义分割[J]. 计算机工程与科学, 2025, 47(12): 2195-2203.
[9]	马冬梅, 王鹏宇, 郭智浩. 一种基于注意力机制的轻量级语义分割[J]. 计算机工程与科学, 2024, 46(8): 1503-1512.
[10]	徐欣, 李若诗, 袁野, 刘娜. 基于可学习图像滤波器的雾天驾驶场景图像语义分割[J]. 计算机工程与科学, 2024, 46(11): 2027-2034.
[11]	邱晓梦, 王琳, 谷文俊, 宋伟, 田浩来, 胡誉. 光流法修正的时序图像语义分割模型[J]. 计算机工程与科学, 2024, 46(1): 102-110.
[12]	厍向阳, 马亦骏. 改进的遥感图像语义分割算法[J]. 计算机工程与科学, 2023, 45(3): 504-511.
[13]	马冬梅, 黄欣悦, 李煜. 基于特征融合和注意力机制的图像语义分割[J]. 计算机工程与科学, 2023, 45(3): 495-503.
[14]	刘榕, 伍欣, 敖斌, 文青, 李宽. 用于CD56图像分割的细胞标注精细化与自适应加权损失[J]. 计算机工程与科学, 2022, 44(5): 870-878.
[15]	刘李漫, 谭龙雨, 彭源, 刘佳. 基于全融合网络的三维点云语义分割[J]. 计算机工程与科学, 2022, 44(5): 862-869.

基于注意力机制的特征融合语义分割模型

A feature fusion semantic segmentation model based on attention mechanism

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价