Computer Engineering & Science
Previous Articles Next Articles
ZHOU Yan,Shereb Dorje
Received:
Revised:
Online:
Published:
Abstract:
Research on Tibetan voiceprint recognition technology has just started, and it is an urgent and necessary task to establish a corpus. We design and build a corpus based on the characteristics of Tibetan language, which consists of two parts: textdependent part and textindependent part. Texts of the corpus are collected from a variety of materials, including newspaper, literature, education, science and technology, Buddhism, and history and traditional culture. As for the recording part, we invite 50 speakers from different regions of Tibet. The corpus contains 9500 speech files and it lays a certain foundation for Tibetan voiceprint recognition.
Key words: Tibetan, voiceprint recognition, corpus
ZHOU Yan,Shereb Dorje. Corpus construction for Tibetan voiceprint recognition[J]. Computer Engineering & Science.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2018/V40/I11/2080