DocumentCode :
659511
Title :
Minimum storage BASIC codes: A system perspective
Author :
Xianxia Huang ; Hui Li ; Tai Zhou ; Yumeng Zhang ; Han Guo ; Hanxu Hou ; Huayu Zhang ; Kai Lei
Author_Institution :
Shenzhen Grad. Sch., Peking Univ., Shenzhen, China
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
39
Lastpage :
43
Abstract :
The explosion of big data stored in distributed file systems calls for more efficient storage paradigms. While replication is widely used to ensure data availability, erasure codes provide a much better tradeoff between storage and availability. Reed-Solomon (RS) codes are the standard design choice, however, their high repair cost is often considered an unavoidable price to pay for high storage efficiency and high reliability. BASIC codes can achieve the optimal tradeoff between storage capacity and repair bandwidth with much less complexity of regenerating codes, which is first proposed in [1]. This paper integrate one construction of the minimum storage BASIC (MS-BASIC) codes [2] into a Hadoop HDFS cluster testbed with up to 22 storage nodes. We demonstrate that MS-BASIC codes conform to the theoretical findings and achieve recovery bandwidth saving compared to the conventional recovery approach based on RS codes.
Keywords :
Reed-Solomon codes; distributed processing; file organisation; Hadoop HDFS cluster; RS codes; Reed-Solomon codes; big data explosion; data availability; distributed file systems; erasure codes; minimum storage BASIC codes; recovery approach; repair bandwidth; storage capacity; storage nodes; storage paradigms; system perspective; Availability; Bandwidth; Encoding; File systems; Maintenance engineering; Telecommunication traffic; Hadoop; big data; distributed file systems; minimum storage BASIC codes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691660
Filename :
6691660
Link To Document :
بازگشت