• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Feature selection for product name
recognition in informal texts

YANG Mei-ni1,HE Tao2,SHEN Jing1,ZHANG Jian-jun1   

  1. (1.College of Science,Naval University of Engineering,Wuhan 430033;
    2.Wuhan Library of Chinese Academy of Sciences,Wuhan 430071,China)
  • Received:2015-09-06 Revised:2015-11-10 Online:2016-10-25 Published:2016-10-25

Abstract:

Most previous studies on named entity recognition (NER) focus on common names such as persons,organizations,and locations in formal texts.With the development of e-commerce and online advertising,how to recognize product names which are special named entities in informal users context becomes more and more important.We design a maximum entropy model to recognize product names from forum posts and explore the impact of various features on the performance.These features include not only traditional features used for NER,but also distributed word representations which are novel ones obtained from the new area of machine learning.We compare the results of the experiments using different feature combinations as inputs.Experiments on the CPROD01 dataset show that the Brown cluster features can improve the accuracy of the product name recognition system.
 

Key words: product name, informal text, maximum entropy model, distributed representation of words