J4 ›› 2008, Vol. 30 ›› Issue (8): 79-82.
• 论文 • Previous Articles Next Articles
Online:
Published:
Abstract:
This paper analyzes several traditional methods for the Chinese word segmentation, compares the advantages and disadvantages of these methods, and presents a new segmentation algorithm. The method adopts the improved bidirectional Markov chain statistical method to update the word library, and then uses the Reverse Maximum Match method based on the word library and the GameTree search algorithm to cut the Chinese word strings. The experimental results show this algorithm has got better effect on veracity, efficiency and new word distinguishment.
Key words: forward maximum match, reverse maximum match, statistical method, definite finite automation
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2008/V30/I8/79