• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (5): 133-135.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • Previous Articles     Next Articles

Study on New Words of Web Based on Statistical Word Segmentation

ZHANG Min,WANG Chunhong   

  1. (Department of Computer Science and Technology,Yuncheng University,Yuncheng 044000,China)
  • Received:2009-09-13 Revised:2009-11-10 Online:2010-04-28 Published:2010-05-11

Abstract: This paper analyzes the various segmentation methods in the information processing technology.In view of the current segmentation methods in the network which do not recognize the new emerging words,we design a new subword method based on statistics. This method avoids complex grammar and rules, needs no enormous support from dictionaries, and resolves the problems brought by the new words. So we conclude that this method has better exactness and is very pragmatic and powerful in practical operations.

Key words: web;statistical word segmentation;dictionary;feature selection

CLC Number: