Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (05): 901-909.
• Artificial Intelligence and Data Mining • Previous Articles Next Articles
CHEN Jian-peng,CHEN Jian,SHE Xiang-rong,SHUI Xin-ying,CHEN Gang
Received:
Revised:
Accepted:
Online:
Published:
Abstract: The standardization of Chinese geographic addresses plays a crucial role in the current construction of smart cities. The traditional geographic address standardization technology usually uses the methods of similarity calculation or rule base matching based on the text character level, and the processing effect of complex, special or redundant addresses is poor. This paper proposes an address match- ing algorithm that combines attention mechanism and multi-level representation by converting the address standardization task into a matching degree calculation task for similar addresses. Firstly, according to the special grammatical structure of the address text, a standard address tree is constructed by using the Trie grammatical tree. Secondly, based on the attention mechanism, the Bi-LSTM network and the CNN network are used to generate multi-level semantic representations of address pairs. Finally, the similarity is calculated by Manhattan distance. On the self-built dataset, the proposed SGAM (Symmetrical Geographic Address Matching) model improves the matching accuracy (91.22%) by 4%~10% in comparison to TextRCNN, FastText, attention-based convolutional neural network (ABCNN) and other models, proving that the SGAM model has better performance on the address matching task.
Key words: geographic address, text similarity calculation, attention mechanism, hybrid neural network, smart city
CHEN Jian-peng, CHEN Jian, SHE Xiang-rong, SHUI Xin-ying, CHEN Gang. An address matching algorithm based on hybrid neural network model and attention mechanism[J]. Computer Engineering & Science, 2022, 44(05): 901-909.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2022/V44/I05/901