• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (11): 1991-1998.

• Graphics and Images • Previous Articles     Next Articles

A multi-scale feature fusion network based fast CU partitioning in HEVC intra coding

LIU Yu-mo1,2,LIU Jian-fei1,2,HAO Lu-guo3,ZENG Wen-bin4   

  1. (1.School of Electronic and Information Engineering,Hebei University of Technology,Tianjin 300131;
    2.National Experimental Teaching Demonstration Center of Electronic and 
    Communication Engineering,Hebei University of Technology,Tianjin 300131;
    3.School of Information Engineering,Guangdong University of Technology,Guangzhou 510006;
    4.School of Electrical and Information Engineering,Tianjin University,Tianjin 300072,China)
  • Received:2022-07-18 Revised:2023-02-13 Accepted:2023-11-25 Online:2023-11-25 Published:2023-11-16

Abstract: High Efficiency Video Coding (HEVC) significantly improves the coding efficiency but increases the coding complexity, especially in the process of coding unit (CU) partitioning based on quadtree structure, so it is important to study the fast CU partitioning. A multi-scale feature fusion network can achieve fast HEVC CU partitioning. Therefore, the UcuNet network structure is designed by combining the U-Net and CU partitioning features. Meanwhile, asymmetric convolutional AC and CBAM attention mechanisms are used to enhance the feature extraction of pixels at different scales. In order to sufficiently  train the deep learning model, the original video with different resolutions and the corresponding encoding information are collected to build a large-scale dataset. Finally, the model is embedded into the HEVC coding architecture to predict the result of CU partitioning in advance, which can effectively reduce the coding complexity caused by CU partitioning by eliminating the recursive rate distortion optimization (RDO) calculation process in the original CU partitioning method. Compared with the official HEVC test model (HM16.20), the proposed UcuNet reduces the average coding time by 68.13% while BD-BR is only decreased by 2.63%.


Key words: high efficiency video coding (HEVC), coding unit partitioning, deep learning, asymmetric convolution