J4 ›› 2015, Vol. 37 ›› Issue (02): 231-237.
• 论文 • Previous Articles Next Articles
YU Juan,LIU Qiang
Received:
Revised:
Online:
Published:
Abstract:
With the exponential growth of network information resources and the growing personalized demands of customers, topicfocused crawler emerges as the times require. Topicfocused crawlers are programs designed to download web pages which are relevant to specific topics. Using information gathered at running time, topicfocused crawlers explore the webs which follow promissory hyperlinks, and fetch only pages which appear to be relevant. The searching engine and corpus building based on topicfocused crawling have been widely used. We first define the goals and operating principles of focused crawling, comprehensively analyze the recent advances at home and abroad, and then compare the crawling strategies of various topicfocused crawlers as well as the advantages and disadvantages of related algorithms. Finally, we point out the future direction of topicfocused crawling.
Key words: web crawler;focused-crawler;searching engine
YU Juan,LIU Qiang. Survey on topic-focused crawlers [J]. J4, 2015, 37(02): 231-237.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2015/V37/I02/231