I/O optimization in ErasureCode distributed
file system for massive data storage

J4 ›› 2013, Vol. 35 ›› Issue (5): 20-27.

• 论文 • Previous Articles Next Articles

I/O optimization in ErasureCode distributed
file system for massive data storage

YAN Lin1,2,XING Jing1,HUO Zhigang1,MA Jie1

(1.Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;
2.University of Chinese Academy of Sciences,Beijing 100049,China)

Received:2012-05-16 Revised:2012-09-20 Online:2013-05-25 Published:2013-05-25

Abstract

Abstract:

As the rapid growing of massive data, the storage method of cluster file system is developing from replication to Erasure Code. The storage system based on Erasure Code can provide higher reliability with less storage overhead. However, in the procedure of storing original data and the coded data, storage based on Erasure Code faces more disk I/O conflicts and unbalanced load, which jeopardizes the throughput of the system. Specially, there is a tradeoff between data consistency and data caching in the storage system based on Erasure Code when writing back the parity. And there are limitations for the use of non-datacaching machanism and alldatacaching machanism in the cluster file system. For these two issues, the paper proposed an I/O optimization method, which includes the data layout machanism balancing the load and the parity consistency machanism. In the cluster file system ECFS we developed base on Erasure Code, the throughput of the system can be improved by 95.53% after exploiting the I/O optimization.

Key words: cluster file system;massive storage;erasurecode;data placement;parity caching;consistency

YAN Lin1,2,XING Jing1,HUO Zhigang1,MA Jie1. I/O optimization in ErasureCode distributed
file system for massive data storage[J]. J4, 2013, 35(5): 20-27.

I/O optimization in ErasureCode distributed
file system for massive data storage

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 0

Recommended Articles 0

Metrics

Comments

I/O optimization in ErasureCode distributed file system for massive data storage

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 0

Recommended Articles 0

Metrics

Comments

I/O optimization in ErasureCode distributed
file system for massive data storage