A code summarization generation model fusing multi-structure data

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (04): 667-675.

• Software Engineering • Previous Articles Next Articles

A code summarization generation model fusing multi-structure data

YU Tian-ci,GAO Shang

(School of Computer Science,Jiangsu University of Science and Technology,Zhenjiang 212100,China)

Received:2023-04-03 Revised:2023-10-13 Accepted:2024-04-25 Online:2024-04-25 Published:2024-04-18

Abstract

Abstract: Code summarization can help developers understand the function and implementation of the code. The code summarization generation model can automatically identify the key information in the code and generate relevant summarization to improve the readability and maintainability of the code. Existing code summarization generation models usually only use abstract syntax tree structure information to represent code, resulting in low-quality model-generated summarization. Aiming at this problem, this paper proposes a code summarization generation model that integrates multi-structure data. Firstly, the model adds data flow graph structure information to represent code on the basis of abstract syntax tree. Secondly, in order to capture the global information of the code, the model uses Transformer's encoder to encode the abstract syntax tree sequence. In addition, the model uses the graph neural network to extract features from the data flow graph and provide information such as the computational depen- dencies between variables. Finally, the model uses the cross-modal attention mechanism to fuse the two features of the abstract syntax tree and the data flow and generate corresponding summarization through the Transformer decoder. The experimental results show that, compared with the six mainstream models, the model improves the scores of BLEU,METEOR and ROUGE-L on the Java and Python datasets, and the generated summarization is also very readable.

Key words: code understanding, code summarization generation, graph neural network, multi-feature fusion, natural language processing

YU Tian-ci, GAO Shang. A code summarization generation model fusing multi-structure data[J]. Computer Engineering & Science, 2024, 46(04): 667-675.

[1]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(07): 1296-1310.
[2]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.
[3]	LI Qing-feng, JIN Liu, MA Hui-fang, ZHANG Ruo-yi. A dual-view contrastive learning-guided multi-behavior recommendation method [J]. Computer Engineering & Science, 2024, 46(04): 707-715.
[4]	MA Xue, HE Xing-xing, LAN Yong-qi, LI Ying-fang. Treelet-based graph neural network for premise selection in first-order logic [J]. Computer Engineering & Science, 2024, 46(02): 374-380.
[5]	SUN Qing-xiao, LIU Yi, YANG Hai-long, WANG Yi-qing, JIA Jie, LUAN Zhong-zhi, QIAN De-pei. GNNSched: A GNN inference task scheduling framework on GPU [J]. Computer Engineering & Science, 2024, 46(01): 1-11.
[6]	ZHOU Ju-xiang, ZHOU Ming-tao, GAN Jian-hou, XU Jian. A question generation model with multi-stage temporal and semantic information enhancement [J]. Computer Engineering & Science, 2023, 45(10): 1847-1857.
[7]	YANG Chun-xia, GUI Qiang, MA Wen-wen, XU Ben, . Aspect-level sentiment analysis of graph attention network fused with graph walk information [J]. Computer Engineering & Science, 2023, 45(10): 1858-1865.
[8]	CAO Jian, CHEN Yi-mei, LI Hai-sheng, CAI Qiang, . A survey of pedestrian trajectory prediction based on graph neural network [J]. Computer Engineering & Science, 2023, 45(06): 1040-1053.
[9]	WANG Yang, CHEN Zhi-bin. A dynamic graph transformer model for solving CVRP [J]. Computer Engineering & Science, 2023, 45(05): 859-868.
[10]	LUO Ke-jin, LIU Guang-cong, YANG Wen-hao. A graph neural network recommendation model based on multi-task learning [J]. Computer Engineering & Science, 2023, 45(04): 726-733.
[11]	YANG Chun-xia, YAO Si-cheng, SONG Jin-jian, . An aspect-level sentiment analysis model based on word co-occurrence [J]. Computer Engineering & Science, 2022, 44(11): 2071-2079.
[12]	. A plant leaf classification method based on multi feature fusion and extreme learning machine [J]. Computer Engineering & Science, 2021, 43(03): 486-493.

A code summarization generation model fusing multi-structure data

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 12

Recommended Articles

Metrics

Comments