• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2016, Vol. 38 ›› Issue (02): 386-394.

• 论文 • 上一篇    下一篇

一种基于情绪激励度的情绪词加权方法

王世泓,牛耘   

  1. (南京航空航天大学计算机科学与技术学院,江苏 南京 210016)
  • 收稿日期:2015-02-28 修回日期:2015-05-01 出版日期:2016-02-25 发布日期:2016-02-25
  • 基金资助:

    国家自然科学基金(61202132,61170043);国家973计划(2013CD744904)

A weighting method of emotion words
based on the level of arousal       

WANG Shihong,NIU Yun   

  1. (School of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China)
  • Received:2015-02-28 Revised:2015-05-01 Online:2016-02-25 Published:2016-02-25

摘要:

在不同的上下文中,情绪词对情绪的激励程度会发生变化。现有情绪词典中大多数只标注了情绪词的情绪类别而未涉及情绪词的激励度。在极少数标注情绪强度的词典中,所标注的强度未考虑上下文的影响。提出一种根据上下文形成的情境评估情绪词对情绪的激励程度并据此对情绪词加权的方法。通过比较情绪词的共现模式与自身情绪类的分布模式计算情绪词的激励程度。然后根据激励程度计算情绪词的情绪权重并将其用于微博情绪识别。实验结果表明,与现有词典中的情绪强度相比,本文方法计算的情绪权重更准确地描述了情绪词在语料中表达的情绪,有效地提高了情绪分析的精度。并且本文方法还能够有效综合多个词典的优势,进一步提高微博情绪分析的准确率。

关键词: 情绪强度, 情绪词典, 语料上下文, 情绪激励度, 情绪权重

Abstract:

The level of arousal for an emotion word may vary in different contexts. However, this information is ignored in most emotion lexicons. We propose a weighting scheme of emotional words based on the level of arousal in various contexts. The level of arousal is evaluated by analyzing the cooccurrence patterns of emotion words as well as the distribution patterns of emotions. The weights of emotion words are calculated according to the level of arousal, and the results are used to weight emotion words in emotion analysis of microblogs. Experimental results on two data sets show that the proposed method outperforms current emotion lexicons. Moreover, the proposed approach can take advantages of multiple lexicons and further improve the accuracy of the system.

Key words: emotion intensity;emotion lexicon;corpus context;level of arousal;emotion weight