• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (8): 1503-1510.

• Artificial Intelligence and Data Mining • Previous Articles     Next Articles

Low-resource multi-dialect Tibetan synthesis method based on Tibetan character components

WANG Jiawen1,2,GAO Dingguo1,2,NI Qiong1,2,BA Guo1,2   

  1. (1.College of Information Science and Technology,Tibet University,Lhasa 850000;
    2.Tibetan Information Technology Innovative Talent Cultivation Demonstration Base,Tibet University,Lhasa 850000,China)
  • Received:2024-02-05 Revised:2024-05-29 Online:2025-08-25 Published:2025-08-27

Abstract: Tibetan synthesis is an important research direction in the field of artificial intelligence,which has significant implications for promoting the development and innovation of Tibetan language information processing.This paper proposes a corpus processing method based on Tibetan character components,aiming to reduce the difficulty of text processing,and adopts an end-to-end speech synthesis model to explore two low-resource multi-dialect Tibetan synthesis schemes.The experiments show that the proposed method can achieve multi-dialect speech synthesis with a single model trained on mixed datasets,improve the naturalness and expressiveness of speech,and achieve an average MOS of 4.56 for speech quality.


Key words: Tibetan character component, low-resource, multi-dialect, Tibetan, speech synthesis