• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2015, Vol. 37 ›› Issue (12): 2282-2293.

• 论文 • Previous Articles     Next Articles

Automatic recognition of the absent topics in Chinese
punctuation clauses based on maximum entropy model 

LU Dawei1,SONG Rou2   

  1. (1.Department of Chinese Language and Literature,Peking University,Beijing 100871;
    2.Institute of Language Information Processing,Beijing Language and Culture University,Beijing 100083,China)
  • Received:2015-09-01 Revised:2015-11-05 Online:2015-12-25 Published:2015-12-25

Abstract:

We focus on the task of the automatic recognition,which identify whether an absent topic of a punctuation clause is the subject or object of its previous sentence. We regard this task as the pointcut of the automatic recognition of absent topics in Chinese punctuation clauses. Several literal features and semantic features are summerized to achieve this task by combining the rules and the maximum entropy model. Experimental results show that Fscore of this recognition approach reaches 82% for the samples of some specific verbs. Experimental results analysis shows that verb features and semantic features play the most important role in the recognition process; neither rules nor statistics can be neglected, and refined knowledge has great influence on the performance of the recognition .

Key words: generalized topic structure;new branch topic;automatic recognition;maximum entropy model