• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Monaural voiced speech separation based
on computational auditory scene analysis

ZHANG Lina,ZHANG Erhua,JIANG Junliang   

  1. (School of Computer Science and Engineering,Nanjing University of Science & Technology,Nanjing 210094,China)
     
  • Received:2018-05-10 Revised:2018-11-08 Online:2019-07-25 Published:2019-07-25

Abstract:

Aiming at the problem of voiced speech separation in monaural speech separation,  we propose an accurate pitch period estimation method. Firstly, using the shortterm stability of speech and the continuity of the pitch period as clues, we use the cepstrum peak of speech signals to form the pitch spectrum, and the pitch period track is automatically extracted. Then, the spectrum of each harmonic is picked up by using the property that the harmonic frequency is an integer multiple of the fundamental frequency. Finally, the voiced speech is reconstructed by the inverse Fourier transform. Experimental results show that this method can accurately extract the pitch period track and effectively separate voiced signals.

 

 

 

Key words: computational auditory scene analysis, speech separation, pitch periodic track, voiced speech