• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2006, Vol. 28 ›› Issue (3): 1-4.

• 论文 •    下一篇

大规模搜索引擎检索系统框架与实现要点

彭波   

  • 出版日期:2006-03-01 发布日期:2010-05-20

  • Online:2006-03-01 Published:2010-05-20

摘要:

随着Web规模的不断扩大,搜索引擎正成为因特网上最常用的应用之一。本文以天网搜索为实例,分析了大规模通用型中文搜索引擎检索系统的设计与实现技术。围绕检索效率和检索效果两个方面,本文介绍天网检索系统的集成框架结构和分布式架构,并分析了索引创建和索引检索中的相关实现技术。

关键词: 搜索引擎 信息检索 天网

Abstract:

With the flourish of the Web, search engine becomes one of the most popular applications on the Internet. In this paper, we analyze the design and imp  lementation of Tianwang, which is a large-scale general Chinese search engine. Based on the principle of efficiency and effectiveness, we describe the i  ntegrated retrieval system framework and the distributed retrieval architecture of Tianwang. Then we analyze the technical details in the index creation and index retrieval, which lead to a high-performance search engine retrieval system

Key words: search engine;information retrieval, Tianwang