J4 ›› 2014, Vol. 36 ›› Issue (05): 971-976.
• 论文 • Previous Articles Next Articles
TANG Shouzhong,QI Jiandong
Received:
Revised:
Online:
Published:
Abstract:
A new vector space model is proposed, which uses both keyword and cooccurrence term as the representation features of documents. Firstly, the keyword candidates are extracted from documents by segmenting texts and removing stop words,and the keyword features are filtered by document frequency.Secondly, based on the obtained keyword features, the cooccurrence word pairs are constructed,and support degree and confidence degree are defined to filter the features of cooccurrence word pairs. Finally, the keyword features and the features of cooccurrence word pairs are combined to construct the vector space model. The textclassification experiments show that the proposed model has better ability of text classification.
Key words: vector space model;cooccurrence word;semantical relationship;text classification
TANG Shouzhong,QI Jiandong. Vector space model based on keywords and cooccurrence word pairs [J]. J4, 2014, 36(05): 971-976.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2014/V36/I05/971