• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (5): 155-159.

• 论文 • 上一篇    下一篇

模糊聚类与LBG级联的VQ算法

姜占才1,2,孙燕3,姚刚1   

  1. (1.青海师范大学物理系,青海 西宁 810008;2.青海师范大学藏文信息处理中心,青海 西宁 810008;
    3.青海民族大学计算机科学与技术学院,青海 西宁 810007)
  • 收稿日期:2010-04-28 修回日期:2010-08-03 出版日期:2011-05-25 发布日期:2011-05-25
  • 作者简介:姜占才(1958),男,青海西宁人,硕士,教授,研究方向为数字信号处理、语音处理和低速率语音编码。孙燕(1973),女,山东青岛人,硕士,副教授,研究方向为语音处理和计算机应用。姚刚(1985),男,四川会理人,硕士生,研究方向为语音处理和低速率语音编码。

VQ Algorithm of Fuzzy Clustering and LBG Cascade

JIANG Zhancai1,2,SUN Yan3,YAO Gang1   

  1. (1.Department of Physics,Qinghai Normal University,Xining 810008;
    2.Tibetan Information Processing Center,Qinghai Normal University,Xining 810008;
    3.School of Computer Science and Technology,Qinghai Nationalities University,Xining 810007,China)
  • Received:2010-04-28 Revised:2010-08-03 Online:2011-05-25 Published:2011-05-25

摘要:

针对LBG算法初始码本随机选取后易出现空胞腔、易陷入局部极小、迭代次数大等缺陷,本文依据模糊聚类理论引入了矢量量化码本设计训练的模糊聚类与LBG级联算法:先用模糊聚类算法训练码本,将训练得到的码本作为传统LBG算法的初始码本,再用传统LBG算法训练。论述了模糊聚类和LBG联合算法的原理与方法;用该算法分别训练了语音线性预测系数的对数面积比(LAR)码本和语音子带浊音度码本;训练过程显示,模糊聚类训练阶段能训练到码本设计目标或接近目标,再经LBG训练阶段,都能达到设计的最佳目标。将训练得到的码本用于多种声码器中进行仿真实验,得到了可懂度高且较自然、清晰的解码语音。

关键词: 矢量量化, 模糊聚类, LBG, 码本, 训练样本集

Abstract:

The joint algorithm of fuzzy clustering and LBG, which was used to train the VQ codebook, was brought forward on the basis of Fuzzy Clustering theory because of the defect that traditional algorithm LBG will emerge empty cell cavity and fall into local minimum and have larger number of iterations; This algorithm is have choice of the initial codebook by fuzzy clustering, and the traditional LBG find better codebooks based on the initial codebook. This paper present principles and methods of joint algorithm .The codebook of Log Area Ratio (LAR) of Coefficient of Linear Prediction and subband voiced strength was trained by this algorithm. The training process indicates that this algorithm could convergence with a faster speed and could have a strong expansion if the training samples are large enough and the wildpoints were removed. In the simulation experiment, the synthesized speech has better quality because of the codebook.

Key words: vector quantization(VQ);fuzzy clustering;LBG;codebook;the training sample set