具有情感表现力的可视语音合成研究综述

J4 ›› 2015, Vol. 37 ›› Issue (4): 813-818.

具有情感表现力的可视语音合成研究综述

曹亮，赵晖

(新疆大学信息科学与工程学院,新疆乌鲁木齐 830046)

收稿日期:2014-03-03 修回日期:2014-05-21 出版日期:2015-04-25 发布日期:2015-04-25
基金资助:
国家自然科学基金资助项目（61261037）

A survey of emotional visual speech synthesis

CAO Liang,ZHAO Hui

(College of Information Science and Engineering,Xinjiang University,Urumqi 830046,China)

Received:2014-03-03 Revised:2014-05-21 Online:2015-04-25 Published:2015-04-25

摘要/Abstract

摘要：

总结和分析了近年来情感可视语音合成领域的一些关键研究成果和研究方法，并根据可视语音合成机制的不同，从基于图像的方法和基于模型的方法两个角度对情感可视语音合成技术进行了系统归类和阐述，分析对比了其各自的优缺点及性能差异。重点讨论了各文献合成的可视语音在真实性和情感表现力两个方面的实现机理和程度。最后指出了合成具有情感表现力的可视语音应该重点考虑的一些问题，为情感可视语音合成的进一步研究指明了方向。

关键词: 可视语音, 情感表现力, 基于模型的方法, 基于图像的方法

Abstract:

We summarize and analyze some of the key findings and research methods of the emotional visual speech in recent years.According to different synthesis mechanisms,we first systematically describe and classify the emotional visual speech synthesis technology from the aspects of both image-based methods and model-based methods.And then we compare and discuss their respective advantages,disadvantages,and performance. Moreover,we conduct an indepth discussion on the realization principles of authenticity and the degree of expressive emotion in visual speech synthesis in various literatures. Finally,we point out some other serious problems which should be taken into account,and the direction for further research on emotional visual speech.

Key words: visual speech;expressive emotion;model-based method;image-based method

曹亮，赵晖. 具有情感表现力的可视语音合成研究综述[J]. J4, 2015, 37(4): 813-818.

CAO Liang,ZHAO Hui. A survey of emotional visual speech synthesis [J]. J4, 2015, 37(4): 813-818.