• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊
论文

Small Sample and SemiSupervized Learningfor Finding Similar Samples

Expand
  • (School of Information Science and  Technology,Southwest Jiaotong University,Chengdu 610031,China)

Received date: 2010-03-13

  Revised date: 2010-06-10

  Online published: 2010-09-02

Abstract

Traditional approach for building text classifiers requires a large number of labeled documents for training a good text classifier. For reallife text classification applications, it is difficult to obtain a large number of labeled documents, so how to get a better result with these labeled and unlabeled documents has become a hot research topic. This paper proposes a new method to expand the set of labeled documents, first we extract a set of representative features from the labeled documents, then according to these representative features we choose the similar samples and add them to the labeled documents. The experiments prove that the method can effectively improve the classification results.

Cite this article

QIN Fei,YANG Yan . Small Sample and SemiSupervized Learningfor Finding Similar Samples[J]. Computer Engineering & Science, 2010 , 32(9) : 127 -129 . DOI: 10.3969/j.issn.1007130X.2010.

Outlines

/