• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A bit string contentaware data chunking algorithm

ZHOU Bin1, ZHU Rongbo1, ZHANG Ying2   

  1. (1.College of Computer Science, SouthCentral University For Nationalities, Wuhan 430074;
    2. School of Foreign Languages, Huazhong University of Science and Technology, Wuhan 430074, China)
  • Received:2016-03-18 Revised:2016-05-03 Online:2016-10-25 Published:2016-10-25

Abstract:

Aiming at the problem of a large amount of overhead introduced by the content defined chunking algorithm (CDC) in calculating the digital signature, we present a novel data chunking algorithm based on bit string content awareness.The proposed algorithm eliminates unmatched positions to the utmost by taking advantage of the bit feature information acquired through each failure matching.Since the maximum jump length is obtained, intermediate calculation and comparison cost are reduced.Experimental results show that the algorithm can reduce the overhead of digital signature calculation in the process of data chunking, cut down CPU resource consumption for chunk boundary determination, and optimize the time performance of data chunking.

Key words: bit string contentaware, data chunking, digital signature