Research on robust speech recognition technology based on domain knowledge

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (12): 2155-2164.

• Software Engineering • Previous Articles Next Articles

Research on robust speech recognition technology based on domain knowledge

WANG Fei-fei,BEN Ke-rong,ZHANG Xian

(College of Electronic Engineering,Navy University of Engineering,Wuhan 430032,China)

Received:2022-08-10 Revised:2022-10-08 Online:2023-12-25 Published:2023-12-14

Abstract

Abstract: Due to the decrease in accuracy of speech recognition software in noisy environments, a robust enhancement method based on domain knowledge is proposed to ensure the safety of using speech control operations. Taking ship control as the application background, a domain knowledge graph is established for ship control. Ship control commands are extracted from nautical books and classic naval warfare film and television materials, and a Chinese speech dataset for ship control commands is constructed. A domain knowledge-embedded decoding method is proposed to correct the output control commands by calculating the matching degree between the recognition result and the domain knowledge graph. Experimental results show that compared with the current popular connection time sequence classification decoding method and attention mechanism-based decoding method, the proposed decoding method reduces the word error rate by 4.0% and 1.5% when recognizing noisy speech with a signal-to-noise ratio of 10dB and 20dB, respectively, and improves the accuracy of command recognition by 10.3% and 6.3%, respectively, improving the robustness of the speech recognition model in recognizing Chinese commands.

Key words: speech recognition, knowledge graph, ship control, robustness

WANG Fei-fei, BEN Ke-rong, ZHANG Xian. Research on robust speech recognition technology based on domain knowledge[J]. Computer Engineering & Science, 2023, 45(12): 2155-2164.

[1]	LUO Yangxia, LI Hao, WU Chenming. Construction and research of malware knowledge graph [J]. Computer Engineering & Science, 2025, 47(01): 86-94.
[2]	ZHANG Qian-kun, HAN Hu, HAO Jun. Aspect-level sentiment classification based on dual attention fusion knowledge [J]. Computer Engineering & Science, 2023, 45(10): 1866-1873.
[3]	MA He, WANG Hai-rong, ZHOU Bei-jing, SUN Chong, XU Xi. Overview of the entity alignment methods based representation learning [J]. Computer Engineering & Science, 2023, 45(03): 554-564.
[4]	ZHANG Ruo-yi, JIN Liu, MA Hui-fang, WANG Yi-ke, LI Qing-feng. A knowledge graph recommendation model incorporating the influence effect of similar users [J]. Computer Engineering & Science, 2023, 45(03): 520-527.
[5]	ZOU Cheng-hui, LI Wei-jiang, . A personalized recommendation model integrating knowledge graph and comment text [J]. Computer Engineering & Science, 2023, 45(01): 181-190.
[6]	LIN Qing, TENG Fei, TIAN Bo, ZHAO Yue, ZHU Jin-ye, FENG Li. An encrypted knowledge graph storage and retrieval scheme based on searchable encryption [J]. Computer Engineering & Science, 2023, 45(01): 66-76.
[7]	PEI Song-ying, CHEN Zhen-guo. Research and application situation analysis of domestic blockchain technology based on bibliometrics [J]. Computer Engineering & Science, 2021, 43(11): 1966-1978.
[8]	GU Jun-hua, SHE Shi-yao, FAN Shuai, ZHANG Su-qi. Recommendation based on users’ long- and short-term preference and knowledge graph convolutional network [J]. Computer Engineering & Science, 2021, 43(03): 511-517.
[9]	SHEN Dong-dong, WANG Hai-tao, JIANG Ying, CHEN Xing. A sequence recommendation algorithm based on knowledge graph embedding and multiple neural networks [J]. Computer Engineering & Science, 2020, 42(09): 1661-1669.
[10]	WU Xiao-chong, DUAN Yue-xing, ZHANG Yue-qin, YAN Xiong. A Chinese entity linking model based on CNN and deep structured semantic model [J]. Computer Engineering & Science, 2020, 42(08): 1514-1520.
[11]	CHEN Xin1,WANG Bin1,ZENG Fan-qing2. Reviewing big data recommendation methods of commodity collocation [J]. Computer Engineering & Science, 2020, 42(01): 36-45.
[12]	WANG Ya-qiang1,2,ZANG Gen-lin1,2,WU Qing-rong1,2,ZHAN Chun-li1,2,XIE Xin-yang1,2. Object root types design of domain knowledge graph ontology [J]. Computer Engineering & Science, 2019, 41(10): 1861-1867.
[13]	ZHAO Weiping1,SUN Ning2,YANG Xiaochun3,ZHENG Guozhen4. Oriental music visual education based on knowledge graph [J]. Computer Engineering & Science, 2018, 40(增刊S1): 56-62.
[14]	DUAN Yucong1,SHAO Lixu1,CAO Buqing2,SUN Xiaobing3,QI Lianyong4. An investment defined transaction processing optimization approach with collaborative storage and computation adaptation [J]. Computer Engineering & Science, 2018, 40(08): 1383-1389.

Research on robust speech recognition technology based on domain knowledge

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 14

Recommended Articles

Metrics

Comments