Fault-tolerant mechanism combined with replication and error correcting code for cloud file systems

Dongri YANG, Ying WANG, Peng LIU

Journal of Tsinghua University(Science and Technology) ›› 2014, Vol. 54 ›› Issue (1) : 137-144.

PDF(2344 KB)
PDF(2344 KB)
Journal of Tsinghua University(Science and Technology) ›› 2014, Vol. 54 ›› Issue (1) : 137-144.
Orginal Article

Fault-tolerant mechanism combined with replication and error correcting code for cloud file systems

Author information +
History +

Abstract

Fault tolerant is important for the reliability of cloud storage file systems. This paper analyzes the reliability of typical cloud file systems with a central metadata server and proposes a fault-tolerant mechanism that combines replication schemes with error correcting codes for storage node reliability guarantee as well as a hot-standby scheme for the metadata server reliability guarantee. Experimental results demonstrate that the mechanism improves the reliability of current cloud storage file systems and at the same time improves the storage utilizations compared with replication schemes.

Key words

cloud storage / fault tolerant / hot stand-by / replication / erasure codes

Cite this article

Download Citations
Dongri YANG, Ying WANG, Peng LIU. Fault-tolerant mechanism combined with replication and error correcting code for cloud file systems[J]. Journal of Tsinghua University(Science and Technology). 2014, 54(1): 137-144

References

[1] Wang Y, Yang D R, Li P. CloStor: A cloud storage system for fast large-scale data I/O [M]//Advance in Computer Science and Its Applications. Springer Berlin Heidelberg, 2014: 1023-1030.
[2] Ghemawat S, Gobioff H, Leung S T. The Google file system [C]// Proc of the Symp on Operating Systems Principles (SOSP 2003). Bolton: ACM Press, 2003: 29-43.
[3] Shvachko K, Kuang H, Radia S, et al. The Hadoop distributed file system [C]// Proc of the IEEE 26th Symp on MSST. Lake Tahoe: IEEE, 2010: 1-10.
[4] Decandia G, Hastorun D, Jampani M, et al.Dynamo: Amazon's highly available key-value store [C]// Proc of the SOSP 2007. Stevenson: ACM Press, 2007: 205-220.
[5] Lakshman A, Malik P. Cassandra: A decentralized structured storage system [J]. ACM SIGOPS Operating Systems Review, 2010, 44(2): 35-40.
[6] Bhagwat D, Pollack K, Long D D E, et al. Providing high reliability in a minimum redundancy archival storage system [C]// Proc of the 14th IEEE International Symposium on MASCOTS. 2006: 413-421.
[7] Spillers N. Storage challenges in the medical industry [C]// The 4th Intelligent Storage Workshop. Digital Technology Center, University of Minnesota, 2006.
[8] Wicker S B, Bhargava V K. Reed-Solomon Codes and Their Applications [M]. Piscataway, NJ: IEEE Press, 1983.
[9] Luby M G, Mitzenmacher M, Shokrollahi M A, et al.Efficient erasure correcting codes[J]. IEEE Transactions on Information Theory, 2001, 47(2): 569-584.

Funding

 
PDF(2344 KB)

Accesses

Citation

Detail

Sections
Recommended

/