• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊
论文

A Decomposition Algorithm for Words Components in the Tibetan Word Frequency Statistics System

Expand
  • (Tibetan Intellectual Information Processing Centre,Qinghai Normal University,Xining 810008,China)

Received date: 2010-03-25

  Revised date: 2010-06-03

  Online published: 2011-03-25

Abstract

Tibetan word frequency statistics is a basic work for Tibetan information processing. Tibetan words are combined by the components from the vertical and horizontal directions, therefore, decomposing the  Tibetan words components is the foundation to sum the attributes of such alphabetic writing. This paper is based on the development of the Tibetan word frequency statistics system, proposes a decomposition algorithm for Tibetan words, and the preliminary experiments show that this algorithm is not only simple and feasible, but also can effectively determine the location of each basic components.

Cite this article

CAI Rang Zhuo Ma,CAI Zhi Jie . A Decomposition Algorithm for Words Components in the Tibetan Word Frequency Statistics System[J]. Computer Engineering & Science, 2011 , 33(3) : 159 -162 . DOI: 10.3969/j.issn.1007130X.2011.

References

1]才旦夏茸.藏文文法详解[M].西宁:青海民族出版社,1988.
[2]格桑居冕.实用藏文文法[M].成都:四川民族出版社,1987.
[3]王维兰.藏文编码输入及其规范研究[J].西北民族大学学报(自然科学版), 2005,26(3):2528.
[4]卢亚军.藏文计算机通用键盘布局与输入法研究[J].中文信息学报,2006,20(2):7886.
[5]才智杰.藏汉英电子词典的开发研究[J].青海师大学报,2005(2):4850.
[6]陈玉忠,俞士汶.藏文信息处理技术的研究现状与展望[J].中国藏学,2004(4):9797.
[7]才智杰.藏文自动切分系统中紧缩词的识别[J].中文信息学报,2009,23(1).
[8]才藏太.班智达藏文语料切分词典的建立与算法研究[J].计算机应用,2009,29(7):20192021.
[9]江荻.现代藏语动词的句法语义分类及相关语法句式[J].中文信息学报,2006,20(1):3743.
[10]高定国,龚育昌.现代藏字全集的属性统计研究[J].中文信息学报,2005,19(1):7175.
[11]艾金勇,李永宏,于洪志,等.藏文字形结构计量统计分析[J].计算机应用,2009,29 (7):20292031.

Outlines

/