• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (3): 444-455.

• 图形与图像 • 上一篇    下一篇

面向农业地块提取的边缘-语义协同双分支解码网络

杨梅,刘司南,潘臻,高磊,闵帆   

  1. (1.西南石油大学计算机与软件学院,四川 成都 610500;
    2.成都师范学院计算机科学学院,四川 成都 611130)

  • 出版日期:2026-03-25 发布日期:2026-03-25
  • 基金资助:
    国家重点研发计划(2021YFB3900905);国家自然科学基金(42471394,42071316);南充市-西南石油大学市校科技战略合作专项(23XNSYSX0084,23XNSYSX0062)

Edge and semantic collaborative dual-branch decoding network for agricultural parcel extraction

YANG Mei,LIU Sinan,PAN Zhen,GAO Lei,MIN Fan   

  1. (1.School of Computer Science and Software Engineering,Southwest Petroleum University,Chengdu 610500;
    2.School of Computer Science,Chengdu Normal University,Chengdu 611130,China)
  • Online:2026-03-25 Published:2026-03-25

摘要: 面向农业资源监测的遥感影像农业地块精准提取是实现耕地资源智能化管理的关键技术。针对现有深度学习方法在复杂农田场景中面临的边界模糊、纹理多样及形态异构导致的分割精度不足问题,提出边缘与语义协同优化的多任务神经网络ESDNet,通过3种关键策略实现性能提升。首先,在编码器与主解码器间嵌入坐标注意力(CA)模块,通过坐标敏感的注意力权重增强模糊边界的鉴别能力;其次,设计具有多级感受野的特征增强(FE)模块,采用金字塔空洞卷积与自适应特征融合策略提升网络对异质纹理的解析度;最后,构建边界映射、距离映射与掩膜映射的多任务协同优化框架,通过几何约束与语义引导的联合学习策略,强化对复杂形态地块的空间认知。为验证网络普适性,实验选取中国山东、四川及荷兰地区的高分二号、哨兵二号多源遥感影像构建测试集。结果表明,ESDNet在交并比IoU指标上分别提升0.77个百分点、2.17个百分点和2.28个百分点,优于现有最优网络,其展现出的强泛化能力和高精度分割特性,为智慧农业中的耕地资源动态监测提供了可靠的技术支撑。


关键词: 农业地块提取, 遥感, 语义分割, 神经网络, 多任务学习

Abstract: Accurate agricultural parcel extraction from remote sensing images for agricultural resource monitoring is a critical technology for achieving intelligent management of cultivated land resources. To address the insufficient segmentation accuracy caused by blurred boundaries, diverse textures, and morphological heterogeneity in complex farmland scenarios in existing deep learning methods, this paper proposes a multi-task neural network ESDNet featuring collaborative edge-semantic optimization. The model achieves performance improvements through three innovative mechanisms: Firstly, a coordinate attention (CA) module is embedded between the encoder and main decoder to enhance the discriminative capability for ambiguous boundaries through coordinate-sensitive attention weighting. Secondly, a feature enhancement (FE) module with multi-level receptive fields is designed, employing pyramid dilated convolutions and adaptive feature fusion strategies to improve the model's resolution of heterogeneous textures. Thirdly, a multi-task collaborative optimization framework inte- grating boundary mapping, distance mapping, and mask mapping is constructed, reinforcing spatial cognition of morphologically complex parcels via a joint learning strategy combining geometric constraints and semantic guidance. To validate the model's generalizability, experiments were conducted on multi-source remote sensing datasets (Gaofen-2 and Sentinel-2 imagery) covering Shandong and Sichuan regions in China and the Netherlands. Results demonstrate that ESDNet achieves superior performance, surpassing state-of-the-art models by 0.77 percentage points, 2.17 percentage points, and 2.28 percentage points in intersection over union (IoU) across the three regions, respectively. The model’s strong generalization capability and high-precision segmentation characteristics provide reliable technical support for dynamic monitoring of cultivated land resources in smart agriculture.

Key words: agricultural parcel extraction, remote sensing, semantic segmentation, neural network, multi-task learning