• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊
论文

A Topic Crawler AlgorithmBased on Semantic Analysis

Expand
  • (School of Computer Science,Beijing University of Technology,Beijing 100124,China)

Received date: 2010-03-12

  Revised date: 2009-06-17

  Online published: 2010-09-02

Abstract

Massive web and its rapid growth make it difficult for generalpurpose search engines to provide satisfactory results for the theme or areaoriented queries. This paper studies the subject of gathering information relevant to the subject, to significantly reduce the amount of web pages dealing. By assessing the degree of Web pages, it gives priority to the crawling pages related to a higher degree. Using a subspacebased semantic analysis technique, combined with the Bayesian mechanism and support vector machine, we design and implement an efficient topic crawler. Experiments show that our algorithm has good accuracy and efficiency.

Cite this article

JIANG Zongli,TIAN Xiaoyan,ZHAO Xu . A Topic Crawler AlgorithmBased on Semantic Analysis[J]. Computer Engineering & Science, 2010 , 32(9) : 145 -147 . DOI: topic crawler;subspace;semanti

Outlines

/