• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (3): 159-162.doi: 10.3969/j.issn.1007130X.2011.

• 论文 • Previous Articles     Next Articles

A Decomposition Algorithm for Words Components in the Tibetan Word Frequency Statistics System

CAI Rang Zhuo Ma,CAI Zhi Jie   

  1. (Tibetan Intellectual Information Processing Centre,Qinghai Normal University,Xining 810008,China)
  • Received:2010-03-25 Revised:2010-06-03 Online:2011-03-25 Published:2011-03-25

Abstract:

Tibetan word frequency statistics is a basic work for Tibetan information processing. Tibetan words are combined by the components from the vertical and horizontal directions, therefore, decomposing the  Tibetan words components is the foundation to sum the attributes of such alphabetic writing. This paper is based on the development of the Tibetan word frequency statistics system, proposes a decomposition algorithm for Tibetan words, and the preliminary experiments show that this algorithm is not only simple and feasible, but also can effectively determine the location of each basic components.

Key words: word frequency statistics;component;decomposition