J4 ›› 2012, Vol. 34 ›› Issue (6): 187-190.
• 论文 • Previous Articles
CAI Zangtai
Received:
Revised:
Online:
Published:
Abstract:
The boundary Ientification of Tibetan sentence is the basical research of Tibetan text analysis. It is the essential work to build a Parallel Corpora between Tibetan and other languages, and also it is the base to do TibetanChinese machine translation. The article raises the ways of Boundary Identification of Tibetan sentences through the analyze of the ending forms of Tibetan sentences and the study of it’s boundary rules. The method is firstly using the special rules and word forms to identify Tibetan Sentences, and then to make a further identification for those ambiguous sentences by using Maximum Entropy Model. So it can improve the boundary identification rate of Tibetan sentences.
Key words: Tibetan sentence;boundary identification;maximum entropy model
CAI Zangtai. Research on the Automatic Identification of Tibetan Sentence Boundaries with Maximum Entropy Classifier[J]. J4, 2012, 34(6): 187-190.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2012/V34/I6/187