• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (9): 146-150.

• 论文 • Previous Articles     Next Articles

Triphone models of Lhasa Tibetan based on decision tree      

LI Guanyu,YU Hongzhi,LI Yonghong,MA Ning   

  1. (Key Laboratory for Chinese Ethnic Minority Language of Ministry of Education,
    Northwest University for Nationalities,Lanzhou 730030,China)
  • Received:2013-04-16 Revised:2013-07-26 Online:2013-09-25 Published:2013-09-25

Abstract:

Probability distribution of monophones and triphones in Lhasa Tibetan are calculated and the necessity of establishing a contextual acoustic model in ASR for Lhasa Tibetan is analyzed. Phoneme is chosen as basic unit for acoustic models. According to the characteristics of Tibetan, a pronunciation dictionary based on single syllable is established. Main issues and algorithms for triphone models based on decision tree are discussed. According to IPAs and characteristics of Lhasa dialect, 38 phoneme subsets and question sets for triphone modeling are established. 8170 sentences of 20 speakers are recorded to train the models. Contextual continuous Hidden Markov Models(HMM) based on triphones are established and trained on HTK platform. The recognition results under different sates number and mixtures are analyzed. And the framework for largevocabulary continuous speech recognition of Lhasa Dialect is established.

Key words: Tibetan;Lhasa dialect;LVCSR;HMM;triphone model