• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2007, Vol. 29 ›› Issue (9): 16-18.

• 论文 • 上一篇    下一篇

基于网页格局的内容分块算法

路松峰 王丹丹   

  • 出版日期:2007-09-01 发布日期:2010-06-02

  • Online:2007-09-01 Published:2010-06-02

摘要:

随着移动上网业务的日益发展,人们迫切希望能够通过手持终端设备访问丰富的Web内容。同时,由于手持终端设备存在着多方面的局限性,使得必须对所要访问的Web页面进行转换处理。本文提出了一种新的内容分块算法,能够智能化地通过分析内容关系对Web页面信息进行分块和抽取,使得手持终端设备用户能够快速、高效地访问Web内容。

关键词: wap 分块 语义关系 内容抽取 手持终端

Abstract:

people are not able to browse the entire Web pages freely and conveniently just like they do with browsers due to their handsets" lack of big screen  and power. So it is necessary to extract the summary of the Web page contents and covert it to something that handsets are able to present and process.   This paper proposes a novel algorithm based on VIPS, which can intelligently segment Web pages into blocks according to the semantic contents so that handsets can browse the contents of the Web pages efficiently.

Key words: wap, segmentation, semantic relationship, content extraction, handset