• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Multi-document summarization extraction based
on multi-information sentence graph model

JIANG Ya-fang1,2,YAN Xin1,2,XU Guang-yi3,ZHOU Feng1,2,DENG Zhong-ying1,2   

  1. (1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology,Kunming 650500;
    2.Yunnan Key Laboratory of Artificial Intelligence,Kunming University of Science and Technology,Kunming 650500;
    3.Yunnan Nantian Electronic Information Industry Co.,Ltd.,Kunming  650041,China)
     
  • Received:2019-07-01 Revised:2019-09-11 Online:2020-03-25 Published:2020-03-25

Abstract:

In view of the problem that the existing multi-document extraction method cannot make good use of sentence topic information and semantic information, this paper proposes a multi-document summarization extraction method that integrates multi-information sentence graph model. Firstly, a sentence graph model with sentences as nodes is constructed. Secondly, the Bayesian topic model based on sentences and the word vector model are combined to get the probability distribution of sentence topic and the semantic similarity of sentences, and the final relevance of sentences is obtained. The topic information and semantic information are used as the edge weights of the sentence graph model. Finally, the summary of the multi-document is described by the summary method of the minimum dominance set of the sentence graph. This method combines the topic information, semantic information and relationship information between sentences by integrating the multi-information sentence graph model. The experimental results show that the method can effectively improve the comprehensive performance of the summarization extraction.
 

Key words: multi-document summarization, sentence Bayesian theme model, word vector, sentence graph model, minimum dominating set