A 3D human pose estimation method integrating semantic graph convolutional network and self-attention mechanism

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (3): 521-530.

• Graphics and Images • Previous Articles Next Articles

A 3D human pose estimation method integrating semantic graph convolutional network and self-attention mechanism

TONG Lijing,YING Yizhuo,CAO Nan

(School of Artificial Intelligence and Computer Science,North China University of Technology,Beijing 100144，China)

Online:2026-03-25 Published:2026-03-25

Abstract

Abstract: Aiming at the problem that it is difficult to capture the global characteristics of human joint sequences and the estimation accuracy is not high, a 3D human pose estimation method combining semantic graph convolutional network and self-attention mechanism is proposed. Firstly, in order to improve the feature extraction effect in the process of mapping from two-dimensional human pose sequence to three-dimensional human pose sequence, self-attention mechanism is integrated into semantic graph convolutional network to carry out spatial feature extraction based on the integration of local features and global features. Secondly, the channel-mixing module of the MLP-Mixer network is improved by introducing a semantic graph convolutional network and a U-shaped MLP structure for temporal feature extraction. Finally, 3D human pose estimation is performed based on the fused features from 2D human images and the extracted temporal features. Experimental evaluations on the Human3.6M dataset for 3D human pose estimation demonstrate that, compared with current mainstream 3D human pose estimation methods, the proposed method reduces the average error metrics MPJPE and PA-MPJPE by approximately 4.5 mm and 0.2 mm compared with the suboptimal method, respectively. The experimental results validate the effectiveness of the proposed method.

Key words: 3D human pose estimation;semantic graph convolutional network;MLP-Mixer model;self-attention , mechanism

TONG Lijing, YING Yizhuo, CAO Nan. A 3D human pose estimation method integrating semantic graph convolutional network and self-attention mechanism[J]. Computer Engineering & Science, 2026, 48(3): 521-530.

[1]	WANG Jing, MA Huifang, ZHANG Mengyuan. Knowledge concept-aware session modeling for knowledge tracing [J]. Computer Engineering & Science, 2026, 48(1): 180-190.
[2]	ZHENG Mingjie, CAO Zhanmao. A hierarchical decoder model with attention collaboration mechanism for solving the heterogeneous capacitated vehicle routing problem [J]. Computer Engineering & Science, 2025, 47(9): 1669-1678.
[3]	ZHANG Feng1, SHAO Yubin1, DU Qingzhi1, LONG Hua1, MA Dinan2. Multimodal aspect-based sentiment analysis based on dual channel graph convolutional network [J]. Computer Engineering & Science, 2025, 47(7): 1321-1330.
[4]	LIN Yi1, 2, 3, SONG Huihui1, 2, 3. A pyramid feature decoupling extraction fusion network for pansharpening [J]. Computer Engineering & Science, 2025, 47(7): 1262-1273.
[5]	CHEN Junyan1, LI Xinmei1, ZHU Changhong2, XIAO Wei3. A routing optimization algorithm for software-defined optical transport network based on multi-view graph attention mechanism [J]. Computer Engineering & Science, 2025, 47(7): 1193-1204.
[6]	LIU Xiang, LI Chuankun, GUO Jinming, LIU Yu. Environmental sound classification based on spatial attention mechanism and multi-feature data enhancement [J]. Computer Engineering & Science, 2025, 47(11): 2038-2044.
[7]	LI Hang, CHEN Zhigang, WANG Yijie, ZHANG Xinyu, LEI Jinghong, LIU Lingfeng. Research on human pose anomaly detection based on spatio temporal graph attention state space model [J]. Computer Engineering & Science, 2025, 47(10): 1830-1840.

A 3D human pose estimation method integrating semantic graph convolutional network and self-attention mechanism

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 7

Recommended Articles

Metrics

Comments