A multi-path and multi-scale attention network for land cover segmentation

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (1): 108-118.

• Graphics and Images • Previous Articles Next Articles

A multi-path and multi-scale attention network for land cover segmentation

LI Yan,FAN Xinyu,CHEN Qin

(School of Automation,Nanjing University of Information Science & Technology,Nanjing 210044,China)

Received:2024-05-08 Revised:2024-06-28 Online:2026-01-25 Published:2026-01-25

Abstract

Abstract: In recent years, Transformers have made remarkable progress in the field of image recognition, yet they still face challenges in pixel-level segmentation tasks, primarily due to their insufficiently explicit and effective handling of local deviations. To address this issue, this paper proposes a multi-path and multi-scale attention network, named DMANet. By integrating the strengths of convolutional neural network (CNN) and Transformers during the encoding phase, this network is capable of simultaneously capturing fine-grained local information and extensive global context from images, effectively enhancing feature extraction capabilities. The proposed interactive dual-branch structure enhances feature integration, improving the model's performance in dense prediction tasks. During the decoding phase, cross-layer feature fusion is implemented to enhance DMANet’s ability to recognize complex objects. DMANet has demonstrated its exceptional performance and broad applicability in complex land cover segmentation tasks through experiments on Potsdam, GID-15, and L8 SPARCS datasets.

Key words: Transformer structure, semantic segmentation, multi-path and multi-scale, convolutional neural network, land cover

LI Yan, FAN Xinyu, CHEN Qin. A multi-path and multi-scale attention network for land cover segmentation[J]. Computer Engineering & Science, 2026, 48(1): 108-118.

[1]	ZHENG Weiwei, ZHENG Zhong, CHEN Wei, LU Hongyi. Comparison and analysis of TAGE-based and neural-based branch predictors [J]. Computer Engineering & Science, 2025, 47(8): 1364-1380.
[2]	WANG Ying, YANG Qing , WANG Xiangyu , ZHANG Yong, . Research on EEG signal emotion analysis based on asymmetric spatial features [J]. Computer Engineering & Science, 2025, 47(5): 921-930.
[3]	CHEN Xu, CHEN Zixiong, JING Yongjun, WANG Shuyang, SONG Jifei. A slice-level vulnerability detection method based on hyperbolic graph convolutional neural network [J]. Computer Engineering & Science, 2025, 47(5): 851-863.
[4]	LI Zhenqi, WANG Qiang, QI Xingyun, LAI Mingche, ZHAO Yankang, LU Yihang, LI Yuan. Design and FPGA implementation of lightweight convolutional neural network hardware acceleration [J]. Computer Engineering & Science, 2025, 47(4): 582-591.
[5]	XU Mengfan, HUANG Wei, GU Zhuoming. A multi-level adversarial mean teacher network for semantic segmentation of nighttime urban landscape [J]. Computer Engineering & Science, 2025, 47(12): 2195-2203.
[6]	YIN Chunyong, LI Rongbiao. Network traffic anomaly detection based on gated fusion and multi-scale convolution [J]. Computer Engineering & Science, 2025, 47(11): 1953-1963.
[7]	MA Dong-mei, WANG Peng-yu, GUO Zhi-hao. A lightweight semantic segmentation based on attention mechanism [J]. Computer Engineering & Science, 2024, 46(8): 1503-1512.
[8]	PAN Yu-qing, YU Hao, LI Feng. An abnormal sound detection method based on weighted non-negative matrix decomposition [J]. Computer Engineering & Science, 2024, 46(8): 1425-1432.
[9]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(7): 1311-1320.
[10]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(5): 945-950.
[11]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(5): 826-835.
[12]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(4): 734-742.
[13]	CHEN Jie, LI Cheng, LIU Zhong. Convolutional neural network inference and training vectorization method for multicore vector accelerators [J]. Computer Engineering & Science, 2024, 46(4): 580-589.
[14]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[15]	XU Xin, LI Ruo-shi, YUAN Ye, LIU Na. Semantic segmentation of foggy driving scenes based on learnable image filter [J]. Computer Engineering & Science, 2024, 46(11): 2027-2034.

A multi-path and multi-scale attention network for land cover segmentation

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments