• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (10): 2009-2013.

• 论文 • Previous Articles     Next Articles

An automatic phoneme segmentation method in continuous
Tibetan language under the condition of resourcedeficiency         

LI Guanyu,YU Hongzhi,WU Zhiqiang   

  1. (Key Laboratory for Chinese Ethnic Minority Language of Ministry of Education,
    Northwest University for Nationalities,Lanzhou 730030,China)
  • Received:2014-06-19 Revised:2014-07-18 Online:2014-10-25 Published:2014-10-25

Abstract:

Phoneme segmentation is often necessary in research of Tibetan TTS or phonetics.Artificial segmentation is a hard job and timeconsuming.The acoustic model of Tibetan language is not precise or robust enough because of resourcedeficiency.Therefore, it is not precise enough when the method of autosegmentation is adopted.Lhasa dialect of Tibetan is chosen as the study object.Phone set and dictionary of Tibetan are established.Common phones are obtained on the basis of distance between phone models. GMMHMM models of English and Lhasa Tibetan are fused.Silences and short pauses are autojudged.Words network is established and then expanded to be a models (or monophones) network.All frames of parameters are segmented and aligned to sates of models by using Viterbi algorithm.Experiments demonstrate that phones are segmented and the result is better than the method of using pure Tibetan models.

Key words: Tibetan;Lhasa dialect;automatic phoneme segmentation;Viterbi;HMM