DocumentCode :
3400117
Title :
Benchmarking the performance of hadoop triple replication and erasure coding on a nation-wide distributed cloud
Author :
Mohan, Lakshmi J. ; Harold, Renji Luke ; Caneleo, Pablo Ignacio Serrano ; Parampalli, Udaya ; Harwood, Aaron
Author_Institution :
Dept. of Comput. & Inf. Syst., Univ. of Melbourne, Melbourne, VIC, Australia
fYear :
2015
fDate :
22-24 June 2015
Firstpage :
61
Lastpage :
65
Abstract :
Large Scale distributed storage systems play a vital role in maintaining data across storage locations globally. These systems use replication as the default mechanism for providing fault-tolerance. Recently, erasure codes are being used as a viable alternative to replication, since they provide the same fault-tolerance for reduced storage overhead. However, their performance is unclear in a geographically diverse distributed storage system. This paper compares the performance of triple replication with the erasure coding (Reed-Solomon codes) used in Apache Hadoop´s implementation of a distributed file system, on a cluster distributed across Australia that runs on the NeCTAR research cloud. Our results show that using erasure coding does not degrade the read performance in such a setting. We also compare the Hadoop´s code with a local reconstruction code, implemented in the XORBAS version of Hadoop. These codes perform well in our clusters but the performance gain observed in our results does not conform to the results reported. Hence, we need new codes that perform better, addressing the geographical diversity issue. We believe that our framework is readily usable to test a range of novel erasure codes that are being introduced in the literature.
Keywords :
Reed-Solomon codes; cloud computing; data handling; distributed processing; Apache Hadoop; Hadoop triple replication; NeCTAR research cloud; Reed-Solomon codes; data across storage locations; distributed file system; erasure coding; geographically diverse distributed storage system; large scale distributed storage systems; nationwide distributed cloud; Australia; Decoding; Distributed databases; Encoding; Facebook; Network coding; Reed-Solomon codes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Coding (NetCod), 2015 International Symposium on
Conference_Location :
Sydney, NSW
Type :
conf
DOI :
10.1109/NETCOD.2015.7176790
Filename :
7176790
Link To Document :
بازگشت