Automatic patent query expansion
based on word embedding

Computer Engineering & Science

Previous Articles Next Articles

Automatic patent query expansion

based on word embedding

LIU Meng-lan1,2 ，LIU Bin1,2，PENG Zhi-yong1,2

(1.State Key Laboratory of Software Engineering,Wuhan University,Wuhan 430072;

2.School of Computer,Wuhan University,Wuhan 430072,China)

Received:2017-07-10 Revised:2017-09-15 Online:2017-12-25 Published:2017-12-25

Abstract

Abstract:

Patent retrieval is very different from information retrieval. Patent texts include right statement, abstract and full text, so we cannot simply apply the retrieval algorithms for common texts to patent retrieval. Patent retrieval usually faces the problem of low recall rate. Firstly, due to the highly professional and complex expression and terms of patent texts, it is not easy to capture the search intent from users’ queries, eventually leading to unsatisfactory search results. Secondly, inventors consciously create some distinctive words when they write patent texts to avoid being retrieved. Many retrieval algorithms are designed to improve the recall rate, however, many problems remain to be solved and the effectiveness be improved. We propose an automatic patent query expansion model based on word embedding. On the basis of word embedding, a keyword network in patent domain is constructed, and then the dense subgraph discovery algorithm is used to find expansion terms, which can improve the effectiveness of expansion terms. Extensive experiments on the CLEF-IP 2012 dataset show that the proposed algorithm can guarantee the flexibility and effectiveness of expansion terms and improve the recall rate of patent retrieval.

Key words: patent retrieval, query expansion, word embedding, deep learning

LIU Meng-lan1,2,LIU Bin1,2，PENG Zhi-yong1,2.

Automatic patent query expansion

based on word embedding

[J]. Computer Engineering & Science.

[1]	YIN Chunyong, ZHANG Xiaohu. Log anomaly detection based on Transformer and Text-CNN [J]. Computer Engineering & Science, 2025, 47(03): 448-458.
[2]	XU Wen, YU Li. A compressive sensing image reconstruction network based on iterative shrinkage thresholding and deep learning [J]. Computer Engineering & Science, 2025, 47(03): 485-493.
[3]	LIU Yongmin, XU Cheng, HUANG Hao, ZHANG Qianlei, ZHAO Junjie, . Research on intrusion detection method based on SAE and WGAN [J]. Computer Engineering & Science, 2025, 47(02): 256-264.
[4]	XU Tianyou, GAO Guangyong. Robust image hiding by invertible generative adversarial network [J]. Computer Engineering & Science, 2025, 47(02): 288-297.
[5]	WU Yuhong, WANG Jian. Fault diagnosis of analog circuits based on Patches-CNN [J]. Computer Engineering & Science, 2025, 47(01): 35-44.
[6]	XU Chao, RUAN Rongyao, CHEN Yong, . A blockchain-based medical data auditing method [J]. Computer Engineering & Science, 2025, 47(01): 95-106.
[7]	CHEN Xinran, LIU Ning, YAN Zhongmin, LIU Lei, CUI Lizhen. An attention-guided dual-granularity cross-modal medical representation learning framework [J]. Computer Engineering & Science, 2025, 47(01): 150-159.
[8]	LUO Jing, YE Zhi-sheng, YANG Ze-hua, FU Tian-hao, WEI Xiong, WANG Xiao-lin, LUO Ying-wei, . Constructing and analyzing deep learning task dataset for R&D GPU clusters [J]. Computer Engineering & Science, 2024, 46(12): 2128-2137.
[9]	JING Chao, BI Yu-shen. OASIS: An interference-aware online scheduling algorithm for deep learning jobs [J]. Computer Engineering & Science, 2024, 46(12): 2138-2148.
[10]	CHEN Lei, LIANG Zheng-you, SUN Yu, CAI Jun-min. Mobile monocular depth estimation based on multi-scale feature fusion [J]. Computer Engineering & Science, 2024, 46(09): 1616-1524.
[11]	LIU Qiang, LI Mu-chun, WU Xiao-jie, WANG Yu-heng. S-JSMA: A fast JSMA adversarial example generation method with low disturbance redundancy [J]. Computer Engineering & Science, 2024, 46(08): 1395-1402.
[12]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(07): 1296-1310.
[13]	HU Zhao-hua, WANG Chang-fu, . A small object detection algorithm of remote sensing image based on improved Faster R-CNN [J]. Computer Engineering & Science, 2024, 46(06): 1063-1071.
[14]	TAN Yu-song, WANG Wei, JIAN Song-lei, YI Chao-xiong. Weakly-supervised IDS with abnormal-preserving transformation learning [J]. Computer Engineering & Science, 2024, 46(05): 801-809.
[15]	GAO Shan, LI Shi-jie, CAI Zhi-ping. A survey of Chinese text classification based on deep learning [J]. Computer Engineering & Science, 2024, 46(04): 684-692.

Automatic patent query expansion

based on word embedding

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments