• DocumentCode
    3740662
  • Title

    Dominoes: Speculative Repair in Erasure-Coded Hadoop System

  • Author

    Xi Yang;Chen Feng;Zhiwei Xu;Xian-He Sun

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2015
  • Firstpage
    366
  • Lastpage
    375
  • Abstract
    Data volume grows dramatically in the era of big data. To save capital cost on storage hardware, datacenters currently prefer using erasure coding rather than simply replication to resist data loss. Erasure coding can provide equivalent three-way fault tolerance to HDFS´s default three replication mechanism but degrades data availability for task scheduling. In an erasure-coded system, data reconstruction time will be paid while tasks access the missing blocks during MapReduce job processing. Tasks´ accessing corrupt data introduces task stragglers and degrades resource utilization. To overcome these challenges, we propose a novel mechanism, Dominoes, that coordinates lightweight data states checking and job scheduling to hide such recovery penalty during job processing and enhances job throughputs. The experimental results confirm Dominoes´ effectiveness and efficiency that improves job throughput by 9% to 9.7% under failure at an overhead of 2.6% for failure-free jobs.
  • Keywords
    "Encoding","Maintenance engineering","Facebook","Metadata","Throughput","Production","Schedules"
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2015 IEEE 22nd International Conference on
  • Type

    conf

  • DOI
    10.1109/HiPC.2015.39
  • Filename
    7397652