Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (2): 309-318.
• Artificial Intelligence and Data Mining • Previous Articles Next Articles
ZHANG Hang,WU Jun
Received:
Revised:
Online:
Published:
Abstract: Abstract:To address the issues of word redundancy and poor readability in extractive summarization, as well as semantic confusion, logical inconsistency, and exposure bias in abstractive summarization, this paper proposes a two-stage text summarization method based on an improved PEGASUS model and an adaptive error correction mechanism, employing a hybrid summarization technique. In the extraction stage, text vectors are obtained using the BERT model, combined with a Bi-GRU and a graph structure. An improved MMR algorithm is utilized to effectively reduce redundancy in candidate summaries, enhancing summary precision. In the generation stage, the extracted sentences are processed by the PEGASUS model, incorporating hierarchical clustering technology and introducing an adaptive error correction mechanism to solve the out-of-vocabulary (OOV) problem. Additionally, a contrastive learning framework is adopted to significantly mitigate exposure bias. Experimental results demonstrate that the model established by our method achieves significant improvements in ROUGE scores on the NLPCC dataset, with average increases of 2.66 percentage points, 0.84 percentage points, and 1.81 percentage points across various metrics compared to models established by existing hybrid methods. This method not only improves summary quality but also exhibits superior performance in resolving OOV problem and exposure bias.
Key words: hybrid summarization, BERT model, PEGASUS model, hierarchical clustering, adaptive error correction mechanism, contrastive learning framework
ZHANG Hang, WU Jun. A two-stage text summarization method based on an improved PEGASUS model and adaptive error correction mechanism[J]. Computer Engineering & Science, 2026, 48(2): 309-318.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2026/V48/I2/309