• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2016, Vol. 38 ›› Issue (4): 800-806.

• 论文 • Previous Articles     Next Articles

Chinese patent attributevalue
extraction technology and its application          

SUN Dongpu,ZHU Minghua,LIN Hongfei   

  1. (School of Computer Science and Technology,Dalian University of Technology,Dalian 116024,China)
  • Received:2015-01-27 Revised:2015-05-15 Online:2016-04-25 Published:2016-04-25

Abstract:

Patent information extraction is the foundation of patent analysis, and its  attributes and attribute value extraction are important to patent information extraction. However, few studies focus on synchronously extracting attributes and their values in Chinese patent information extraction. Using abstracts of the Chinese patents as corpus, we propose a conditional random fields (CRFs) method based on statistic learning knowledge. Firstly,regarding the attributes and attribute values as named entities,we obtain a CRFs model by training sets, and then use this model to extract attributes and attribute values from the corpus.Secondly, we employ association rules to match the attributes and their values. Experimental results show that the accuracy, recall and Fscore can reach 80.8%, 81.2% and 81.0% respectively.The comparison of the extraction results proves the practical value of the proposal.

Key words: attribute extraction;attribute value extraction;Chinese patent;conditional random fields (CRFs)