• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (5): 140-142.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • 上一篇    下一篇

改进的功率谱二次处理基音检测法

朱建伟1,孙水发1,2,但志平1,2,雷帮军1,2   

  1. (1.三峡大学电气与新能源学院,湖北 宜昌 443002;2.三峡大学智能视觉与图像信息研究所,湖北 宜昌 443002)
  • 收稿日期:2009-09-13 修回日期:2009-12-10 出版日期:2010-04-28 发布日期:2010-05-11
  • 通讯作者: 朱建伟 E-mail:zhujianwei1015@163.com
  • 作者简介:朱建伟(1985),男,湖北老河口人,硕士生,研究方向为语音信号处理;孙水发,博士,副教授,研究方向为信息隐藏和多媒体信息处理。
  • 基金资助:
    湖北省教育厅重大项目(Z20081301);湖北省自然科学基金资助项目(2008CDB346);宜昌市科学技术研究与开发项目(A0930231,A0930232)

Improved Power Spectrum Reprocessing Pitch Detection Method

ZHU Jianwei1,SUN Shuifa1,2,DAN Zhiping1,2,LEI Bangjun1,2   

  1. (1.School of Electrical Engineering and New Energy,China Three Gorges University,Yichang 443002; 2.Institute of Intelligent Vision and Image Information,China Three Gorges University,Yichang 443002,China)
  • Received:2009-09-13 Revised:2009-12-10 Online:2010-04-28 Published:2010-05-11

摘要: 作为语音信号处理中的一项关键技术,基音检测一直是研究热点。本文分析了功率谱二次处理基音检测方法的不足:对于过渡语音,易产生半频或倍频误判;噪声干扰下,检测结果易失真;清、浊音的判断方法复杂。针对这些不足,本文提出一系列改进方法:时域非线性处理,频域加窗滤波,简化清、浊音判断。MATLAB仿真实验结果表明,无论是高信噪比还是低信噪比语音,改进的二次谱法较AMDF法和二次谱法更能清晰、准确地检测出基音轨迹。

关键词: 基音检测, 倒谱法, 功率谱二次处理, 非线性处理

Abstract: As a key technique, the pitch detection has been a hot spot in the field of speech processing. The following three disadvantages of power spectrum reprocessing (PSR) method for pitch detection are found: half pitch error and double pitch error with transition sound; easy distortion in noise speech; the complex method of judging voiceless and voiced speech. A series of improvement methods are proposed: nonlinear processing at timedomain; windowed filtering at frequencydomain; simplification method of judging voiceless and voiced speech. The experimental results based on MATLAT show that the improved method detects pitch trajectory more clearly and accurately than the AMDF method and the PSR method.

Key words: pitch detection;cepstrum method;power spectrum reprocessing;nonlinear processing

中图分类号: