• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (04): 712-720.

Previous Articles     Next Articles

An image semantic segmentation method based on path aggregation Atrous convolutional network

LI Shu-ao,XIE Qing,MA Yan-chun,LIU Yong-jian   

  1. (School of Computer Science and Technology,Wuhan University of Technology,Wuhan 430070,China)
  • Received:2020-01-15 Revised:2020-06-18 Accepted:2021-04-25 Online:2021-04-25 Published:2021-04-21

Abstract: The deep full convolutional neural network based on encoder-decoder structure has made significant progress in image semantic segmentation. However, the path of transferring low-level positioning information in the deep network to the high-level network is too long, which makes it difficult to use low-level positioning information in the decoder stage to restore the boundary structure of the object. Aiming at this problem, a path aggregation structure used in the decoder part of segmentation network is proposed. This structure shortens the propagation path of low-level information to high-level information in the segmentation network and provides multi-scale contextual semantic information, so that the segmentation network can produce more refined boundary segmentation results. Aiming at the pro- blem that the softmax cross-entropy loss function often used in semantic segmentation is insufficient to distinguish samples with similar appearance, this paper reforms the softmax cross-entropy loss function and proposes a bidirectional cross-entropy loss function. Combining the proposed path aggregation Atrous convolutional network with the new loss function method can obtain better results on the PASCAL VOC2012Aug data set, which increases the mIoU value from 78.77% to 80.44%.


Key words: semantic , image segmentation;bidirectional cross-entropy;path aggregation structure;multi-scale prediction;deep learning