• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2015, Vol. 37 ›› Issue (12): 2318-2323.

• 论文 • Previous Articles     Next Articles

Study of modern Uyghur word stem POS tag set    

Azragul1,2,Mirxat3,Yusup·Abaydula1   

  1. (1.School of Computer Science & Technology,Xinjiang Normal University,Urumqi 830054;
    2.The Xinjiang Technical Institute of Physics & Chemistry,Chinese Academy of Sciences,Urumqi 830011;
    3.School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China)
  • Received:2015-08-11 Revised:2015-10-15 Online:2015-12-25 Published:2015-12-25

Abstract:

Taking the Uyghur word stem POS tagging of the Uygur language textbooks which are in use in primary schools as the verification object, we validate the feasibility, adaptability and reliability of Modern Uyghur Word Stem POS Tag set which is made from the perspective of grammatical semantic combination. We first describe the electronic corpus of primary school Uyghur language textbooks; secondly, we discuss the basic situation of "the partofspeech and tagging set standards of modern Uyghur word stem information processing", and the design and algorithm of multistrategy modern Uyghur Words Stem tagging system model; finally, we analyse the experimental results, validate the scientificity of Modern Uyghur Word Stem POS Tag set, supplement and correct parts of the semantic classification and codes and recommend a substantial expansion of the standard.

 

Key words: modern Uyghur word stem;POS tagging;tag set;verification