• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A Chinese address recognition method
 based on address semantics
 

LI Xiaolin1,ZHANG Yi1,LI Lin2   

  1. (1.Hubei Key Laboratory of Intelligent Robot,Wuhan Institute of Technology,Wuhan 430205;
    2.School of Resource and Environmental Science,Wuhan University,Wuhan 430079,China)
     
  • Received:2017-12-25 Revised:2018-08-15 Online:2019-03-25 Published:2019-03-25

Abstract:

There are a large number of Chinese address text in the Internet that contains rich spatial location information. In order to obtain the address location information in the text more effectively, we propose a Chinese address location information recognition method based on address semantics. According to the statistics of word frequency of the training corpus, we obtain a set of address feature words and word transition probability. Then, we construct a feature word transition probability matrix. Finally, combining with the string maximum joint probability algorithm, we put forward an address recognition method which does not depend on address dictionary and tagging of the part of speech. Experimental results show that the exact match rate of the method is 76.85% for ambiguous Chinese addresses with prominent feature words, and the recognition accuracy is 93.11%. Compared with the mechanical matching algorithm and the methods for constructing the transition probability matrix based on experience, experimental results verify the feasibility and effectiveness of the proposed method.

Key words: address semantics, feature character word, transfer probability, without dictionary