J4 ›› 2015, Vol. 37 ›› Issue (02): 252-256.
• 论文 • Previous Articles Next Articles
ZHOU Jingcai1,HU Huaping1,2,YUE Hong1
Received:
Revised:
Online:
Published:
Abstract:
With the continuous improvement of informationization, a highperformance, full-featured text search system, which can fast locate the matching records among massive data, has become a new research hotspot. Based on the analysis of the fundamentals of the fulltext retrieval techniques and the structure of Lucene system, we present a MVCpattern fulltext retrieval model and develop a retrieval system based on SSH framework and Lucene search engine. It has three contributions. Firstly this system optimizes the supported file formats, and adds PDF, HTML, and RTF along with TXT, Ms office documents into the search library. Secondly, it improves the Chinese words segmentation machine in efficiency and accuracy. Thirdly, it enhances humanmachine interaction and achieves a similar display function as Baidu and Google, which can highlight the search keywords. The practical application of this system demonstrates that it is efficient in creating indexes and can speed up search with much more relevant results.
Key words: Lucene;document parse;fulll-text retrieval;search engine
ZHOU Jingcai1,HU Huaping1,2,YUE Hong1. Design and implementation of Lucene-based full-text retrieval system [J]. J4, 2015, 37(02): 252-256.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2015/V37/I02/252