• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学

• 论文 • 上一篇    下一篇

基于情绪特征的中文微博用户性别识别

刘宝芹,牛耘   

  1. (南京航空航天大学计算机科学与技术学院,江苏 南京 210016)
  • 收稿日期:2015-07-13 修回日期:2015-09-15 出版日期:2016-09-25 发布日期:2016-09-25
  • 基金资助:

    国家自然科学基金(61202132)

Gender recognition of Chinese micro-blog  users based on emotion features 

LIU Bao-qin,NIU Yun   

  1. (School of Computer Science and Technology,Nanjing University of Aeronautics  and Astronautics,Nanjing 210016,China)
  • Received:2015-07-13 Revised:2015-09-15 Online:2016-09-25 Published:2016-09-25

摘要:

随着互联网的蓬勃发展,微博受到了越来越多用户的青睐,对微博用户性别的研究也逐渐成为学术界研究的热点。目前,对英文微博文本用户的性别识别已有研究,但针对中文微博用户性别识别的研究较少。从两性表达情绪的差异出发,提出了一种基于情绪特征的中文微博用户性别识别方法。本文考虑的情绪特征包括情绪词特征和与情绪相关的语言风格特征。实验结果表明,利用情绪特征提高了用户性别识别的精度。

关键词: 性别识别, 中文微博, 情绪风格特征, 情绪词特征

Abstract:

With the vigorous development of the Internet, micro-blog service is attracting more and more users.Gender recognition of micro-blog users thereby becomes a hot research topic.Tremendous efforts have been made on gender recognition of Twitter users. However, research on Chinese micro-blogs users is still new. Based on the difference in the emotion expressions between males and females, we propose a gender recognition method of Chinese micro-blog users based on emotion features. The emotion features including emotional words and linguistic style features associated with the emotions. Experimental results show that using emotion features can improve the accuracy of gender recognition.

Key words: gender recognition, Chinese micro-blogs, emotion style features, emotional words