基于ENet的轻量级语义分割算法研究

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (08): 1454-1460.

基于ENet的轻量级语义分割算法研究

徐世杰，杜煜，鹿鑫，吴思凡

（北京联合大学智慧城市学院,北京 100101）

收稿日期:2020-05-15 修回日期:2020-08-24 接受日期:2021-08-25 出版日期:2021-08-25 发布日期:2021-08-24
基金资助:
国家自然科学基金（91420202）

A lightweight semantic segmentation algorithm based on ENet

XU Shi-jie，DU Yu，LU Xin，WU Si-fan

（Smart City College,Beijing Union University,Beijing 100101,China）

Received:2020-05-15 Revised:2020-08-24 Accepted:2021-08-25 Online:2021-08-25 Published:2021-08-24

摘要/Abstract

摘要： 语义分割算法能够对图像进行像素级的分类，广泛应用于无人驾驶、医学图像处理和工业自动化等领域，具有重要研究价值。对语义分割算法的研究集中在提升分割精度、降低参数量和增加推理速度3个方面。经典的轻量语义分割算法ENet使用多层卷积的编解码器和大量的空洞卷积来避免过多的下采样和利用空间信息，虽能保证一定的空间信息完整性与较大的感受野，但存在编解码器臃肿、空间信息传递性差、感受野溢出并造成网格效应等问题。对ENet算法结构进行裁剪，利用注意力机制和金字塔结构的空洞卷积设计了空间信息传递模块，优化算法结构，改善算法感受野，完整传递空间信息，提出了改进的ENet算法

C-ENet+AM+RAM
。在公开数据集Cityscapes和BDD100K上的实验结果表明,新模块能够以更小的参数量与计算量提升原有模型性能，证明了原算法删减部分的冗余性与所设计模块的有效性。

关键词: 语义分割, 轻量级, 实时性, 注意力机制, 感受野, 空洞卷积

Abstract: Semantic segmentation algorithms can classify images at the pixel level, and are widely used in fields such as unmanned driving, medical image processing, and industrial automation, and have important research value. The research of semantic segmentation algorithms focuses on three aspects: improving the accuracy of segmentation, reducing the amount of parameters and increasing the speed of inference. The lightweight semantic segmentation algorithm ENet uses a multi-layer convolutional codec and a large number of dilated convolutions to avoid excessive downsampling and use of spatial information. Although it retains some spatial information integrity and large receptive field, the codec is bloated, the transmission of spatial information is poor, and the sensory field overflows and causes grid effect. Aiming at the above problems, this paper tailors the ENet algorithm structure, uses the attention mechanism and the pyramid dilated convolution to design spatial information transmission module, optimizes the algorithm structure, improves the algorithm receptive field, and completely transmits the spatial information transmission. The experimental results on public datasets Cityscapes and BDD100K show that the new module can improve the performance of the original algorithm with a smaller amount of parameters and calculations, which proves the redundancy of the original algorithm and the effectiveness of the designed module.

Key words: semantic segmentation, lightweight, real-time, attention mechanism, receptive field, dilated convolution

徐世杰, 杜煜, 鹿鑫, 吴思凡. 基于ENet的轻量级语义分割算法研究[J]. 计算机工程与科学, 2021, 43(08): 1454-1460.

XU Shi-jie, DU Yu, LU Xin, WU Si-fan. A lightweight semantic segmentation algorithm based on ENet[J]. Computer Engineering & Science, 2021, 43(08): 1454-1460.

编辑推荐

Metrics

阅读次数

全文

315

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	315

来源	本网站	其他网站

次数	252	63
比例	80%	20%

摘要

263

最新录用	在线预览	正式出版

0	0	263

	来源	本网站

	次数	263
	比例	100%

[1]	徐超, 阮荣耀, 陈勇, . 一种基于区块链的医疗数据审计方法[J]. 计算机工程与科学, 2025, 47(01): 95-106.
[2]	陈兆波, 张琳, 马晓轩. 改进注意力混合自动编码器视频异常检测研究[J]. 计算机工程与科学, 2025, 47(01): 130-139.
[3]	徐欣, 李若诗, 袁野, 刘娜. 基于可学习图像滤波器的雾天驾驶场景图像语义分割[J]. 计算机工程与科学, 2024, 46(11): 2027-2034.
[4]	付燕, 杨旭, 叶鸥. 基于CNN和Transformer特征融合的烟雾识别方法[J]. 计算机工程与科学, 2024, 46(11): 2045-2052.
[5]	余佳妮, 胡朝霞, 蒋从锋. 一种基于多特征的日志事件异常检测方法研究[J]. 计算机工程与科学, 2024, 46(09): 1587-1597.
[6]	陈磊, 梁正友, 孙宇, 蔡俊民. 多尺度特征融合的移动端单目深度估计研究[J]. 计算机工程与科学, 2024, 46(09): 1616-1524.
[7]	刘国岐, 何廷年, 荣艺煊, 李卓然. 基于用户轨迹和好友关系的兴趣点推荐[J]. 计算机工程与科学, 2024, 46(09): 1693-1701.
[8]	杨胜荣, 车文刚, 高盛祥, 赵云莱. 多阶段特征蒸馏加权的轻量级图像超分辨率网络[J]. 计算机工程与科学, 2024, 46(08): 1433-1443.
[9]	刘晓华, 徐茹枝, 杨成月. 一种基于多特征融合嵌入的中文命名实体识别模型研究[J]. 计算机工程与科学, 2024, 46(08): 1473-1481.
[10]	马冬梅, 王鹏宇, 郭智浩. 一种基于注意力机制的轻量级语义分割[J]. 计算机工程与科学, 2024, 46(08): 1503-1512.
[11]	张永智, 何可人, 戈珏. 改进YOLOv7网络在低空遥感图像目标检测中的应用[J]. 计算机工程与科学, 2024, 46(07): 1269-1277.
[12]	王泽宇, 徐慧英, 朱信忠, 李琛, 刘子洋, 王子奕. 基于YOLOv8改进的密集行人检测算法：MER-YOLO[J]. 计算机工程与科学, 2024, 46(06): 1050-1062.
[13]	邓翔宇, 裴浩媛, 盛迎. 基于网络融合的改进MobileViT人脸表情识别[J]. 计算机工程与科学, 2024, 46(06): 1072-1080.
[14]	张玉莹, 朱广丽, 谈光璞, . 基于情感增强和语义依存的金融隐式情感分析模型[J]. 计算机工程与科学, 2024, 46(06): 1112-1120.
[15]	尹春勇, 赵峰. 基于双层注意力和深度自编码器的时间序列异常检测模型[J]. 计算机工程与科学, 2024, 46(05): 826-835.