[1]Yamagishi J, Kobayashi T, Nakano Y, et al. Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm[J].IEEE Transactions on Audio,Speech,and Language Processing,2009,17(1):66-83.
[2]Erro D,Moreno A,Bonafonte A.Voice conversion based on weighted frequency warping[J].IEEE Transactions on Audio, Speech,and Language Processing,2010,18(5):922-931.
[3]Desai S,Black A W,Yegnanarayana B,et a1.Spectral mapping using artificial neural networks for voice conversion[J].IEEE Transactions on Audio,Speech, and Language Processing,2010,18(5):954-964.
[4]Saruwatari T H, Shikano K. Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum[C]∥Proc of IEEE International Conference on Acoust,Speech,Signal Processing, 2001:841-844.
[5]Mohamed A,Dahl G E,Hinton G.Acoustic modeling using deep belief networks[J].IEEE Transactions on Audio,Speech,and Language Processing,2012,20 (1):14 -22.
[6]Xu You-liang,Zhang Lian-hai,Zhang Wen-lin,et al.A speaking rate adaptation technique and phonological attribute posterior for phone recognition[J].Signal Processing,2012,28(2):295-300.(in Chinese)
[7]Xu Ning, Yang Zhen. High quality voice morphing system[J].Journal of Applied Science-Electronics and Information Engineering,2008,26(4):378-383.(in Chinese)
[8]Lei Yun, Hansen J H L. Dialect classification via text-independent training and testing for Arabic,Spanish,and Chinese[J].IEEE Transations on Audio,Speech,and Language Processing,2011,19(1):85-96.
[9]Huang Chen-chen,Gong Wei,Fu Wen-long,et al.Research of speech emotion recognition based on DBNs[J].Journal of Computer Research and Development,2014,51(Suppl):75-80.(in Chinese)
[10]Ma Yong,Bao Chang-chun,Xia Bing-yin.Speaker segmention based on discriminative deep belief networks[J].Journal of Tsinghua University(Sci&Tech),2013,53(6):804-807.(in Chinese)
附中文参考文献:
[6]许友亮,张连海,张文林,等.基于语速调整和音位属性后验概率的音素识别[J].信号处理,2012,28(2):295-300.
[7]徐宁,杨震.高合成质量的语音转换系统[J].应用科学学报,2008,26(4):378-383.
[9]黄晨晨,巩微,伏文龙,等.基于深度信念网络的语音情感识别的研宄[J].计算机研宄与发展, 2014,51(Suppl):75-80.
[10]马勇,鲍长春,夏丙寅.基于辨别性深度信念网络的说话人分割[J].清华大学学报(自然科学版),2013,53(6):804-807. |