• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2015, Vol. 37 ›› Issue (01): 168-172.

• 论文 • Previous Articles     Next Articles

Two-class text categorization using nearest subspace search 

LI Yujian,WANG Ying,LENG Qiangkui   

  1. (College of Computer Science,Beijing University of Technology,Beijing 100124,China)
  • Received:2013-04-26 Revised:2013-07-03 Online:2015-01-25 Published:2015-01-25

Abstract:

The nearest neighbor search algorithm is a simple method with high accuracy in text categorization, but it usually requires large amounts of calculation in the classifying process. To overcome this disadvantage, a twoclass text categorization method is proposed based on the nearest subspace search. It extracts a feature subspace from samples in the same class, and maps it to a point in a higher dimensional space, in which the classifying process is carried out by nearest neighbor search. Experiments on Reuters-21578 data sets show that the proposed method can effectively improve the performance of nearest neighbor search in text categorization, achieving a higher precision, recall rate, and F1 values.

Key words: text categorization;nearest subspace search;nearest neighbor search