基于特征融合和注意力机制的图像语义分割

计算机工程与科学 ›› 2023, Vol. 45 ›› Issue (03): 495-503.

基于特征融合和注意力机制的图像语义分割

马冬梅，黄欣悦，李煜

（西北师范大学物理与电子工程学院，甘肃兰州 730070）

出版日期:2023-03-25 发布日期:2023-03-23
基金资助:
国家自然科学基金（61961037）

Image semantic segmentation based on feature fusion and attention mechanism

MA Dong-mei,HUANG Xin-yue,LI Yu

（School of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China）

Online:2023-03-25 Published:2023-03-23

摘要/Abstract

摘要： 针对目前高精度语义分割模型需要大量计算资源，难以在硬件存储和计算力有限的嵌入式平台上部署，提出了一种基于特征融合和注意力机制的图像语义分割模型。首先，对基于DeepLabV3+的模型进行优化，采用通道剪枝对MobileNetV2骨干网络轻量化；然后，在轻量化后的模型中引入拆分三重注意力模块(STA)来提高特征图内部维度相关性；最后，在解码部分增加细粒度上采样模块完善边缘细节信息。在PASCAL VOC 2012和Cityscapes数据集上的实验中，本文模型的参数量仅为4.15×106，浮点计算量为10.23 GFLOPs，平均交并比分别为70.98%和72.26%，表明该模型在计算资源、内存占用和准确性之间达到了较好的均衡。

关键词: 图像处理, 语义分割, DeepLabV3+, 通道剪枝, 拆分三重注意力, 细粒度上采样

Abstract: The current high-precision semantic segmentation model requires huge computing resources, so it is difficult to deploy on embedded platforms with limited hardware storage and computing power. Aiming at this issue, an image semantic segmentation model based on feature fusion and attention mechanism is proposed. Firstly, the model based on DeepLabV3+ is optimized and the MobileNetV2 backbone network is lightened using channel pruning. Secondly, the Splittable Triplet Attention (STA) is introduced to the lightweight model to improve the internal dimensional correlation of the feature map. Finally, fine-grained up-sampling modules are added in the decoding part to improve the edge detail information. In the experiments on Pascal VOC 2012 and cityscapes datasets, the parameter number of the proposed algorithm is only 4.15×106, the number of floating-point operations is 10.23 GFLOPs, and the average intersection ratio is 70.98% and 72.26% respectively. The results show that the model achieves a good balance among computing resources, memory consumption and accuracy.

Key words: image processing, semantic segmentation, DeepLabV3+, channel pruning, splittable triplet attention, fine-grained upsampling

马冬梅, 黄欣悦, 李煜. 基于特征融合和注意力机制的图像语义分割[J]. 计算机工程与科学, 2023, 45(03): 495-503.

MA Dong-mei, HUANG Xin-yue, LI Yu. Image semantic segmentation based on feature fusion and attention mechanism[J]. Computer Engineering & Science, 2023, 45(03): 495-503.

编辑推荐

Metrics

阅读次数

全文

437

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	437

来源	本网站	其他网站

次数	315	122
比例	72%	28%

摘要

最新录用	在线预览	正式出版

0	0	226

[1]	徐欣, 李若诗, 袁野, 刘娜. 基于可学习图像滤波器的雾天驾驶场景图像语义分割[J]. 计算机工程与科学, 2024, 46(11): 2027-2034.
[2]	马冬梅, 王鹏宇, 郭智浩. 一种基于注意力机制的轻量级语义分割[J]. 计算机工程与科学, 2024, 46(08): 1503-1512.
[3]	邱晓梦, 王琳, 谷文俊, 宋伟, 田浩来, 胡誉. 光流法修正的时序图像语义分割模型[J]. 计算机工程与科学, 2024, 46(01): 102-110.
[4]	陈海永, 吕承杰, 杜春, 陈鹏. 孪生注意力门控融合的遥感图像变化检测编解码网络[J]. 计算机工程与科学, 2023, 45(09): 1593-1601.
[5]	厍向阳, 马亦骏. 改进的遥感图像语义分割算法[J]. 计算机工程与科学, 2023, 45(03): 504-511.
[6]	李忠瑞, 崔宾阁, 杨光, 张昊卿. 基于深度学习的海岸线边缘检测网络模型[J]. 计算机工程与科学, 2022, 44(12): 2220-2229.
[7]	刘从军, 徐佳陈, 肖志勇, 柴志雷. 基于深度学习的心脏核磁共振图像自动分割算法[J]. 计算机工程与科学, 2022, 44(09): 1646-1654.
[8]	刘李漫, 谭龙雨, 彭源, 刘佳. 基于全融合网络的三维点云语义分割[J]. 计算机工程与科学, 2022, 44(05): 862-869.
[9]	刘榕, 伍欣, 敖斌, 文青, 李宽. 用于CD56图像分割的细胞标注精细化与自适应加权损失[J]. 计算机工程与科学, 2022, 44(05): 870-878.
[10]	马冬梅, 李鹏辉, 黄欣悦, 张倩, 杨鑫. 改进DeepLabV3+的高效语义分割[J]. 计算机工程与科学, 2022, 44(04): 737-745.
[11]	王杨, 郁振鑫, 卢嘉, . 融合视觉注意机制的图像显著性区域风格迁移方法[J]. 计算机工程与科学, 2022, 44(01): 118-123.
[12]	徐世杰, 杜煜, 鹿鑫, 吴思凡. 基于ENet的轻量级语义分割算法研究[J]. 计算机工程与科学, 2021, 43(08): 1454-1460.
[13]	李叔敖, 解庆, 马艳春, 刘永坚. 基于路径聚合扩张卷积的图像语义分割方法[J]. 计算机工程与科学, 2021, 43(04): 712-720.
[14]	周飞,唐建,杨成松,芮挺. 基于混合自动编码器道路语义分割方法研究[J]. 计算机工程与科学, 2019, 41(08): 1453-1458.
[15]	李熙莹1,2,3,4,周智豪1,2,3,4,吕硕1,2,3,4. 基于选择性搜索算法的车脸部件检测[J]. 计算机工程与科学, 2018, 40(10): 1829-1836.