J4 ›› 2012, Vol. 34 ›› Issue (6): 140-145.
• 论文 • Previous Articles Next Articles
WU Shuang,ZHANG Wensheng,XU Hairui
Received:
Revised:
Online:
Published:
Abstract:
The traditional feature selection algorithms usually select features distinguishing the different types of documents by the evaluation functions. However, these methods take the separate word as unit to establish a vector space model. The important words in the documents and the relationship between words are not realized. In allusion to the disadvantages mentioned above, a new feature selection algorithm based on the relationship between words is presented. This algorithm considers key words, mines words’ association and checks these association rules by a correlation analysis to produce a feature space which closely relates to the category attributes. The experiment indicates that this method is better to express the semantic content of the documents and has a good categorization result.
Key words: relationship between words;feature selection;association rule;text categorization
WU Shuang,ZHANG Wensheng,XU Hairui. A Text Feature Selection Algorithm Based on Analysing the Relationship Between Words[J]. J4, 2012, 34(6): 140-145.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2012/V34/I6/140