• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (10): 1795-1803.

• Graphics and Images • Previous Articles     Next Articles

A fast paper edge detection method based on cross-layer feature fusion

XU Kun,ZHAO Qi-wen,XU Yuan,LIU You-quan   

  1. (School of Information Engineering,Chang’an University,Xi’an 710021,China)
  • Received:2021-05-12 Revised:2021-06-24 Accepted:2022-10-25 Online:2022-10-25 Published:2022-10-28

Abstract: Combined with the real-time and robust requirements of paper detection in the common paper-pen interaction, a fast paper detection method based on edge detection is proposed. In the edge detection stage, a fast paper edge detection method based on cross-layer feature fusion is advanced.  The linear bottleneck inverted residual blocks and efficient channel B-ECA blocks are added to the HED backbone, which greatly reduce the numbers of parameters and increase the weight of salient features. The features of all stages and all layers are fused in order to retain the more edge features. The high-level features are upsampled and cross-layer fused with the low-level features to solve the problem of edge blur. Training and testing on the self-made MPDS data set shows that, compared with the original HED method, the proposal increases the ODS and OIS by 8.1% and 6.6% respectively, and improves the detection speed from 22.08 FPs to 39.02 FPS. In the paper extraction stage, a paper extraction method based on the paper structure is proposed. After thinning the paper edge based on non-maximum suppression, detection and filtering the line, and extraction paper vertex based on structural constraints, the image containing only paper is extracted. The experimental results show that the paper extraction method can quickly and accurately extract the entire paper image in various complex desktop environments and occlusion situations, which provides an interaction basis for the common paper-pen interaction method.

Key words: paper edge detection, holistically-nested edge detection(HED), channel attention, cross-layer fusion, non-maximum suppression(NMS) ,