• DocumentCode
    3066429
  • Title

    A fast map-reduce algorithm for burst errors in big data cloud storage

  • Author

    Xue Qin ; Kelley, Brian ; Saedy, Mahdy

  • Author_Institution
    Dept. of ECE, Univ. of Texas at San Antonio, San Antonio, TX, USA
  • fYear
    2015
  • fDate
    17-20 May 2015
  • Firstpage
    398
  • Lastpage
    403
  • Abstract
    In distributed storage for Big Data systems, there is a need for exact repair, high bandwidth codes. The challenge for exact repair in big-data storage is to simultaneously enable both very high bandwidth repair using Map-Reduce and simple coding schemes that also combine robust maximally distance separable (MDS) exact repair. MDS repair is for the rare, but exceptional outlier error patterns requiring optimum erasure code reconstruction. We construct the optimum fast bandwidth repair for big-data sources. Our system uses Map-Reduce, exact repair reconstruction. The algorithm combines MDS with a second fast decode algorithm in a cloud environment. We illustrate cloud experiments for optimum fast bandwidth reconstruction for 1-Exabyte Big Data in the cloud and demonstrate cloud results for Poisson error rate arrival models. Unlike prior methods, we jointly solve the problem of fast bandwidth repair for burst-memory error patterns and for code rates up to - in a real time error model framework for Big Data. Furthermore, simulations indicate this method outperforms prior fast bandwidth approaches for burst errors. We also illustrate Map-Reduce algorithm optimized for fast bandwidth repair in Big Data storage in clouds.
  • Keywords
    Big Data; cloud computing; parallel processing; stochastic processes; storage management; 1-Exabyte Big Data; Big Data systems; Big data cloud storage; Big-Data sources; MDS repair; MapReduce algorithm; Poisson error rate arrival models; bandwidth codes; burst-memory error patterns; cloud environment; coding schemes; distributed storage; exact repair reconstruction; optimum erasure code reconstruction; optimum fast bandwidth repair; real time error model framework; robust maximally distance separable exact repair; Bandwidth; Big data; Distributed databases; Encoding; Image reconstruction; Maintenance engineering; Systems engineering and theory; Exabyte Big Data; Map-Reduce; cloud systems; erasure codes; exact repair; fast reconstruction; maximally distance separable;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System of Systems Engineering Conference (SoSE), 2015 10th
  • Conference_Location
    San Antonio, TX
  • Type

    conf

  • DOI
    10.1109/SYSOSE.2015.7151912
  • Filename
    7151912