• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (06): 1101-1107.

• 论文 • 上一篇    下一篇

高效的连续不确定XML数据Top-k查询算法

张晓琳,郑春红,刘立新,吕庆   

  1. (内蒙古科技大学信息工程学院,内蒙古 包头 014010)
  • 收稿日期:2012-11-13 修回日期:2013-04-10 出版日期:2014-06-25 发布日期:2014-06-25
  • 基金资助:

    国家自然科学基金资助项目(61163015);内蒙古自然科学基金资助项目(20080404Zd21)

An efficient algorithm of top-k
inquires over continuous uncertain XML          

ZHANG Xiaolin,ZHENG Chunhong,LIU Lixin,L Qing   

  1. (School of Information Engineering,Inner Mongolia University of Science and Technology,Baotou 014010,China)
  • Received:2012-11-13 Revised:2013-04-10 Online:2014-06-25 Published:2014-06-25

摘要:

目前,不确定XML数据的top-k查询算法中都没有处理连续不确定数据,本文提出SPCProTJFast算法,该算法改进了传统的归并算法,并结合连续不确定数据的过滤方法,实现了连续不确定XML的Top-k查询。为了避免概率下限值过小对过滤效果的影响,又提出HPCProTJFast算法,该算法推迟了对连续节点的处理,只有在获得满足概率条件的整枝路径时才对连续节点进行访问。实验表明,在执行时间以及过滤效率上,同直接处理连续不确定数据的ProTJFast算法相比,这两种算法都要更高效,并且HPCProTJFast算法的效率更高。

关键词: 连续不确定, XML, 归并, top-k

Abstract:

Currently, the top-k query algorithms about the uncertain XML data can not deal with continuous uncertain data. The SPCProTJFast  algorithm is proposed, which improves the traditional merging algorithm, combines with continuous uncertain data filtering methods, implements the top-k query algorithm over continuous uncertain XML data. In order to avoid the impact of too small probability limit on filtering effect, the HPCProTJFast  algorithm is proposed, which delays the handling of continuous types of nodes and visits the continuous nodes only when the entire twig that meets the probability condition are acquired. Experimental results show that, in terms of the execution time and the filtration efficiency, these two algorithms are more efficient than the ProTJFast algorithm that deals with continuous uncertain data directly, and the HPCProTJFast algorithm is the most efficient.

Key words: continuous uncertain;XML;merge;top-k