• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2008, Vol. 30 ›› Issue (5): 59-64.

• 论文 • 上一篇    下一篇

更新数据流上的连续Skyline计算

田李 李爱平 邹鹏 贾焰   

  • 出版日期:2008-05-01 发布日期:2010-05-19

  • Online:2008-05-01 Published:2010-05-19

摘要:

本文考虑"更新数据流"场景下的连续Skyline计算问题。在该环境下,数据不再满足"先进先出"特性,使得传统基于滑动窗口数据流上的连续Skyline计算方法不再适用。在对问题进行了形式化描述后,本文提出了基本算法BUSM,在分析其不足的基础上提出了一种网格索引数据结构,基于该结构提出了GUSM算法。该算法利用了更新数据流中删除和添加操作成对同时出现的特性,以网格为单位表示影响区域并进行快速排除预处理。理论分析和实验结果证明了上述方法在更新数据流上连续计算Skyline的有效性。

关键词: Skyline 更新数据流 网格索引数据结构

Abstract:

In this paper, we address the continuous skyline computation problem, and consider a new scenario named update stream where the First-In-First-Out rul  e (which is the basic and important feature of traditional sliding window models) does not hold, which leads to the existing algorithms' inapplicabil  lity. The problem is formally described; a Basic Updatestream Skyline Monitoring algorithm (BUSM) is raised and analyzed; a novel grid-indexed data st  tructure is presented, and then a Grid-based Update-stream Skyline Monitoring algorithm (GUSM) is proposed, which makes use of the characteristics whe ere the deletion and addition operations appear simultaneously in update streams, and represents the influence region by grids for the previous eliminat ion. Analytical and experimental results show that the proposed approaches perform well on the continuous skyline computation over update data streams.

Key words: Skyline, update data stream, grid-indexed data structure