Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (06): 1121-1127.
• Artificial Intelligence and Data Mining • Previous Articles Next Articles
CHEN Lu1,2,DONG Ling1,2,WANG Wen-jun1,2,WANG Jian1,2,YU Zheng-tao1,2,GAO Sheng-xiang1,2
Received:
Revised:
Accepted:
Online:
Published:
Abstract: The Burmese language speech recognition text contains a large number of homophones and space errors. General methods use text semantic information to correct erroneous characters, but they are not accurate in locating and correcting Burmese space and homophone errors. Considering that Burmese is a tonal language with tone information embedded within its phonemes, this paper proposes a method for correcting errors in Burmese language speech recognition text that incorporates phonemes. Parameter sharing strategy is used to jointly model the transcribed texts and theirs phonemes, phoneme information is used to assist in detecting and correcting Burmese homophones and space errors. Experimental results show that compared with ConvSeq2Seq method, the F1 value of the proposed method in the Burmese speech recognition correction task has increased by 85.97%, reaching 79.15%.
Key words: Burmese language, speech recognition text correction, phoneme, shared parameter, bidirectional encoder representations from transformers(BERT)
CHEN Lu, DONG Ling, WANG Wen-jun, WANG Jian, YU Zheng-tao, GAO Sheng-xiang, . Text error correction of Burmese speech recognition based on phoneme fusion[J]. Computer Engineering & Science, 2024, 46(06): 1121-1127.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2024/V46/I06/1121