DocumentCode
3740662
Title
Dominoes: Speculative Repair in Erasure-Coded Hadoop System
Author
Xi Yang;Chen Feng;Zhiwei Xu;Xian-He Sun
Author_Institution
Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
fYear
2015
Firstpage
366
Lastpage
375
Abstract
Data volume grows dramatically in the era of big data. To save capital cost on storage hardware, datacenters currently prefer using erasure coding rather than simply replication to resist data loss. Erasure coding can provide equivalent three-way fault tolerance to HDFS´s default three replication mechanism but degrades data availability for task scheduling. In an erasure-coded system, data reconstruction time will be paid while tasks access the missing blocks during MapReduce job processing. Tasks´ accessing corrupt data introduces task stragglers and degrades resource utilization. To overcome these challenges, we propose a novel mechanism, Dominoes, that coordinates lightweight data states checking and job scheduling to hide such recovery penalty during job processing and enhances job throughputs. The experimental results confirm Dominoes´ effectiveness and efficiency that improves job throughput by 9% to 9.7% under failure at an overhead of 2.6% for failure-free jobs.
Keywords
"Encoding","Maintenance engineering","Facebook","Metadata","Throughput","Production","Schedules"
Publisher
ieee
Conference_Titel
High Performance Computing (HiPC), 2015 IEEE 22nd International Conference on
Type
conf
DOI
10.1109/HiPC.2015.39
Filename
7397652
Link To Document