[1] |
Khaldoon A,Ghaidan,Huthaifa A,et al.Artificial intelligence for speech recognition based on neural networks[J].Journal of Signal & Information Processing,2015,6(2):66-72.
|
[2] |
Gavrilescu M. Improved automatic speech recognition system by using compressed sensing signal reconstruction based on L0 and L1 estimation algorithms[C]∥Proc of 2015 7th International Conference on Electronics,Computers and Artificial Intelligence(ECAI),2015:S-23-S-28.
|
[3] |
Yong H,Tian X L,Chen H K,et al.Humanoid and superhuman perception in the AI 2.0 Era:Research summary and trend outlook(English)[J].Frontiers of Information Technology & Electronic Engineering,2017,18(1):58-68.
|
[4] |
Liu Jian-wei, Ding Xi-hao, Luo Xiong-lin. Survey of multimodal deep learning [J]. Application Research of Computers,2020,37(6):1601-1614.(in Chinese)
|
[5] |
Hou Yi-min, Zhou Hui-qiong, Wang Zheng-yi. Overview of speech recognition based on deep learning [J]. Application Research of Computers, 2017,34 (8): 2241-2246. (in Chinese)
|
[6] |
Li Yun-hong, Liang Si-cheng, Jia Kai-li, et al. An improved speech recognition method based on DNN-HMM model [J]. Applied Acoustics, 2019,38 (3): 371-377. (in Chinese)
|
[7] |
Cao Jing-jing, Xu Jie-ping, Shao Sheng-qi. Hierarchical speech recognition model in multi-noise environment [J].Journal of Computer Applications, 2018,38 (6): 1790-1794. (in Chinese)
|
[8] |
Zhang Yu, Ji Zhe, Wan Xin, et al. An experiment of acoustic model adaptation based on deep neural network [J].Journal of Tianjin University (Natural Science and Engineering Technology Edition), 2015,48 (9): 765-770. (in Chinese)
|
[9] |
Wang M, Wang Y, Zhu X J. An improved adaptation algorithm for signer-independent sign language recognition[J]. International Journal of Intelligent Systems Technologies and Applications 2018, 17(4):427-438.
|
[10] |
Qu Dan, Zhang Wen-lin. Speaker adaptation method based on eigenphone speaker subspace for speech recognition [J]. Journal of Electronics & Information Technology, 2015,37 (6): 1350-1356. (in Chinese)
|
[11] |
Jin Chao, Gong Cheng, Li Hui. Speaker adaptation research of neural network acoustic model in speech recognition [J].Computer Applications and Software, 2018,35 (2): 200-205. (in Chinese)
|
[12] |
Lou Ying-dan, Xu Jing-lin, Huang Li-xia, et al. Speech recognition based on MLLR and MAP under distant noise reverberation environment [J]. Computer Engineering and Applications, 2020,56(10):122-126.(in Chinese)
|
[13] |
Liu B S,Chen X M,Han Y H,et al.Accelerating DNN-based 3D point cloud processing for mobile computing[J].Science China(Information Sciences),2019,62(11):40-50.
|
[14] |
Yu Dong,Deng Li.Parsing deep learning:Speech recognition practice [M]. Beijing:Electronic Industry Press,2016.(in Chinese)
|
[15] |
Zhang Wen-lin, Niu Tong, Zhang Lian-hai,et al.Rapid speaker adaptation based on maximum likelihood variable subspace [J].Journal of Electronics & Information Technology,2012,34(3):571-575.(in Chinese)
|
[16] |
Xiang J,Dong T,Pan R,et al.Clothing attribute recognition based on RCNN framework using L-Softmax loss[J].IEEE Access,2020,8:48299-48313.
|
[17] |
Joy N M,Baskar M K,Umesh S.DNNs for unsupervised extraction of pseudo speaker-normalized features without explicit adaptation data[J].Speech Communication,2017,92:64-76.
|
[18] |
Yang Jian-bin, Zhang Wei-qiang, Liu Jia. Investigation of normalization methods in speaker adaptation of deep neural network using i-vector [J].Journal of University of Chinese Academy of Sciences, 2017,34(5):633-639. (in Chinese)
|
[19] |
Dehak N,Kenny P,Dehak R,et al.Front-end factor analysis for speaker verification[J].IEEE Transactions on Audio,Speech,and Language Processing,2011,19(4):788-798.
|
[20] |
Yun S,Choi J Y,Shattuck-Hufnagel S.A landmark-based approach to transcribing systematic variation in the implementation of flapping in American English[J].Journal of the Acoustical Society of America,2017,141(5):3583-3583.
|
[21] |
Qu Dan,Yang Xu-kui,Zhang Wen-lin.Feature space eigenphone speaker adaptation [J].Acta of Automation Sinica,2015,41(7):1244-1252.(in Chinese)
|
|
附中文参考文献:
|
[4] |
刘建伟,丁熙浩,罗雄麟.多模态深度学习综述[J].计算机应用研究,2020,37(6):1601-1614.
|
[5] |
侯一民,周慧琼,王政一.深度学习在语音识别中的研究进展综述[J].计算机应用研究,2017,34(8):2241-2246.
|
[6] |
李云红,梁思程,贾凯莉,等.一种改进的DNN-HMM的语音识别方法[J].应用声学,2019,38(3):371-377.
|
[7] |
曹晶晶,许洁萍,邵聖淇.多噪声环境下的层级语音识别模型[J].计算机应用,2018,38(6):1790-1794.
|
[8] |
张宇,计哲,万辛,等.基于DNN的声学模型自适应实验[J].天津大学学报(自然科学与工程技术版),2015,48(9):765-770.
|
[10] |
屈丹,张文林.基于本征音子说话人子空间的说话人自适应算法[J].电子与信息学报,2015,37(6):1350-1356.
|
[11] |
金超,龚铖,李辉.语音识别中神经网络声学模型的说话人自适应研究[J].计算机应用与软件,2018,35(2):200-205.
|
[12] |
娄英丹,徐静林,黄丽霞,等.MLLR和MAP在远场噪声混响下的语音识别研究[J].计算机工程与应用,2020,56(10):122-126.
|
[14] |
俞栋,邓力.解析深度学习:语音识别实践[M].北京:电子工业出版社,2016.
|
[15] |
张文林,牛铜,张连海,等.基于最大似然可变子空间的快速说话人自适应方法[J].电子与信息学报,2012,34(3):571-575.
|
[18] |
杨建斌,张卫强,刘加.深度神经网络自适应中基于身份认证向量的归一化方法[J].中国科学院大学学报,2017,34(5):633-639.
|
[21] |
屈丹,杨绪魁,张文林.特征空间本征音说话人自适应[J].自动化学报,2015,41(7):1244-1252.
|