Low-resource multi-dialect Tibetan synthesis method based on Tibetan character components

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (8): 1503-1510.

• Artificial Intelligence and Data Mining • Previous Articles Next Articles

Low-resource multi-dialect Tibetan synthesis method based on Tibetan character components

WANG Jiawen1,2,GAO Dingguo1,2,NI Qiong1,2,BA Guo1,2

(1.College of Information Science and Technology,Tibet University,Lhasa 850000；
2.Tibetan Information Technology Innovative Talent Cultivation Demonstration Base,Tibet University,Lhasa 850000，China)

Received:2024-02-05 Revised:2024-05-29 Online:2025-08-25 Published:2025-08-27

Abstract

Abstract: Tibetan synthesis is an important research direction in the field of artificial intelligence,which has significant implications for promoting the development and innovation of Tibetan language information processing.This paper proposes a corpus processing method based on Tibetan character components,aiming to reduce the difficulty of text processing,and adopts an end-to-end speech synthesis model to explore two low-resource multi-dialect Tibetan synthesis schemes.The experiments show that the proposed method can achieve multi-dialect speech synthesis with a single model trained on mixed datasets,improve the naturalness and expressiveness of speech,and achieve an average MOS of 4.56 for speech quality.

Key words: Tibetan character component, low-resource, multi-dialect, Tibetan, speech synthesis

WANG Jiawen1, 2, GAO Dingguo1, 2, NI Qiong1, 2, BA Guo1, 2. Low-resource multi-dialect Tibetan synthesis method based on Tibetan character components[J]. Computer Engineering & Science, 2025, 47(8): 1503-1510.

[1]	JING Rong1, WAN Fucheng1, 2, HUANG Rui1, YU Hongzhi1, 2, MA Ning1, 2. Tibetan long text classification by fusing denoising fine-tuning and graph attention mechanism [J]. Computer Engineering & Science, 2025, 47(6): 1133-1140.
[2]	GAZANG Cairang1, 2, GAO Dingguo1, 2 , RENQING Dongzhu1. An automatic Tibetan dialect identification method by integrates multiple features [J]. Computer Engineering & Science, 2025, 47(6): 1114-1120.
[3]	BAN Qi, YUN Jing, DENG Lei, . Research on Chinese—traditional Mongolian cross-lingual summarization methods in low-resource scenarios [J]. Computer Engineering & Science, 2025, 47(5): 931-939.
[4]	SHEN Ying-li, ZHAO Xiao-bing, . A neural machine translation method based on language model distillation [J]. Computer Engineering & Science, 2024, 46(4): 743-751.
[5]	ZHAO Ya-li , YU Zheng-tao, GUO Jun-jun, GAO Sheng-xiang, XIANG Yan, . A cross-language sentiment classification model based on emotional semantic confrontation [J]. Computer Engineering & Science, 2023, 45(2): 338-345.
[6]	ZHU Qianqian, CHE Wengang, MIAO Han. An implementation method of diversified fonts in digital Tibetan ancient books#br# #br# [J]. Computer Engineering & Science, 2020, 42(11): 2073-2079.
[7]	XIA Wu-ji1,2，HUAQUE Cai-rang1. Semantic dependence analysis of Tibetan based on projection [J]. Computer Engineering & Science, 2019, 41(10): 1868-1873.
[8]	XIA Wuji1,2，HUAQUE Cairang1. Automatic translation between Arabic numerals and Tibetan numerals based on finite state automata [J]. Computer Engineering & Science, 2018, 40(3): 550-554.
[9]	ZHOU Yan,Shereb Dorje. Corpus construction for Tibetan voiceprint recognition [J]. Computer Engineering & Science, 2018, 40(11): 2080-2084.
[10]	ROU Te. Research on question classification of Tibetan question-answering system [J]. J4, 2015, 37(7): 1393-1398.
[11]	YANG Xianze,CHEN Yihong. Analysis and research of Chinese-Tibetan machine translation features and handwritten Chinese characters segmentation [J]. J4, 2014, 36(8): 1595-1598.
[12]	LI Guanyu,YU Hongzhi,WU Zhiqiang. An automatic phoneme segmentation method in continuous Tibetan language under the condition of resourcedeficiency [J]. J4, 2014, 36(10): 2009-2013.
[13]	LI Guanyu,YU Hongzhi,LI Yonghong,MA Ning. Triphone models of Lhasa Tibetan based on decision tree [J]. J4, 2013, 35(9): 146-150.
[14]	CAI Zangtai. Research on the Automatic Identification of Tibetan Sentence Boundaries with Maximum Entropy Classifier [J]. J4, 2012, 34(6): 187-190.
[15]	CAI Zhi Jie,CAI Rang Zhuo Ma. Design of a Tibetan Word Segmentation System [J]. J4, 2011, 33(5): 151-154.

Low-resource multi-dialect Tibetan synthesis method based on Tibetan character components

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments