• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2015, Vol. 37 ›› Issue (12): 2306-2311.

• 论文 • 上一篇    下一篇

中文事件相关性语料库构建及识别方法

黄一龙,李培峰,朱巧明   

  1. (1.苏州大学计算机科学与技术学院,江苏 苏州 215006;2.江苏省计算机信息处理技术重点实验室,江苏 苏州 215006)
  • 收稿日期:2015-09-03 修回日期:2015-10-26 出版日期:2015-12-25 发布日期:2015-12-25
  • 基金资助:

    国家自然科学基金资助项目(61472265);国家自然科学基金重点资助项目(61331011);江苏省前瞻性联合研究资助项目(BY201405908);软件新技术与产业化协同创新中心部分资助项目

Construction and its recognition
of Chinese relevant event  

HUANG Yilong,LI Peifeng,ZHU Qiaoming   

  1. (1.School of Computer Science and Technology,Soochow University,Suzhou 215006;2.Province Key Lab of Computer Information Processing Technology of Jiangsu,Suzhou 215006,China)
  • Received:2015-09-03 Revised:2015-10-26 Online:2015-12-25 Published:2015-12-25

摘要:

:事件往往围绕主题展开,相互间存在相关性。在大数据时代,从海量信息中筛选出和某个主题相关的事件,有助于信息抽取、文本摘要、文本生成等自然语言处理任务。首先提出一种相关事件的标注方法,并标注了一个中文事件相关性语料库。然后,初步提出了一个基于多种特征的相关性事件识别方法。在标注语料上的实验表明,性能在基准系统上F1值提高了408%。关键词:

关键词: 相关事件语料库, 标注, 相关性, 事件关系

Abstract:

There are many relevant events concerning a topic. In the era of big data, extracting those events which are relevant to a specific topic is helpful for many natural language processing applications, such as information extraction, text summarization, and text generation. We propose a method to annotate relevant events and construct a Chinese relevant event corpus. We then put forward a relevant event recognition approach based on various distances and semantic features. Experimental results on the annotated corpus show that the proposed approach outperforms the baseline by 4.08% in F1-measure.

Key words: relevant event corpus;annotation;relevance;event relation