Research of Uyghur Language Text  Categorization Based on SVM

J4 ›› 2012, Vol. 34 ›› Issue (12): 150-154.

• 论文 • Previous Articles Next Articles

Research of Uyghur Language Text Categorization Based on SVM

Alimjan AYSA1,2，Turgun IBRAHIM2，Kurban OBUL2，Hasan OMAR2

（1.Center of Modern Education Technology,Xinjiang University,Urumqi 830046；
2.College of Information Science and Engineering,Xinjiang University,Urumqi 830046,China）

Received:2011-12-30 Revised:2012-03-05 Online:2012-12-25 Published:2012-12-25

Abstract

Abstract:

The automatic text categorization technique has important practical significance and broad application prospect in improving the validity and accuracy of the use of text information.With the rapid increase of Uyghur language text information on the Internet,Uyghur language text categorization has become a key technique of processing and organizing these text data.As to the high dimensionality of Uyghur language text under vector space model representation,the stemming technique is used along with χ2 to reduce the dimensionality.Uyghur language text categorizer is constructed based on SVM.The experimental results based on Uyghur language text corpus show that the MacroF1 value of SVM categorizer can reach 84.6% and outperform the kNN approach.

Key words: text categorization;SVM;kNN;uyghur language

Alimjan AYSA1,2，Turgun IBRAHIM2，Kurban OBUL2，Hasan OMAR2. Research of Uyghur Language Text Categorization Based on SVM[J]. J4, 2012, 34(12): 150-154.

Research of Uyghur Language Text Categorization Based on SVM

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 0

Recommended Articles

Metrics

Comments