• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Voice conversion based on optimizing GRNN by PSO

WANG Min,YANG Xiufeng,YAO Chenhong   

  1. (School of Information and Control Engineering,Xi’an University of Architecture and Technology,Xi’an 710055,China)
  • Received:2016-03-25 Revised:2016-11-02 Online:2018-04-25 Published:2018-04-25

Abstract:

The paper proposes a new voice conversion method based on using Particle Swarm Optimization (PSO)to optimize General Regression Neural Network (GRNN).Firstly, the method utilizes the characteristic parameters of the training speaker’s vocal tract and source excitation to train two GRNNs, and then obtains the structure parameters of GRNNs. Secondly, in order to reduce the adverse impact of artificial maninduced factors on conversion results,  PSO is used to optimize the parameters of the GRNN model. Finally, the pitch contour and the energy profile of prosodic features are linearly converted, thus making the converted voice contain more personalized feature information of source speaker.Experimental results show that,compared with the radial basis function neural network(RBF) and the GRNN based voice conversion methods,our method improves the naturalness and likelihood of the converted voices and evidently decreases the spectral distortion rate, so the converted voices are more closed to the target voices.

Key words: voice conversion, general regression neural network model, particle swarm optimization