• DocumentCode
    3298752
  • Title

    Result Integrity Check for MapReduce Computation on Hybrid Clouds

  • Author

    Yongzhi Wang ; Jinpeng Wei ; Srivatsa, Mudhakar

  • Author_Institution
    Florida Int. Univ., Miami, FL, USA
  • fYear
    2013
  • fDate
    June 28 2013-July 3 2013
  • Firstpage
    847
  • Lastpage
    854
  • Abstract
    Large scale adoption of MapReduce computations on public clouds is hindered by the lack of trust on the participating virtual machines, because misbehaving worker nodes can compromise the integrity of the computation result. In this paper, we propose a novel MapReduce framework, Cross Cloud MapReduce (CCMR), which overlays the MapReduce computation on top of a hybrid cloud: the master that is in control of the entire computation and guarantees result integrity runs on a private and trusted cloud, while normal workers run on a public cloud. In order to achieve high accuracy, CCMR proposes a result integrity check scheme on both the map phase and the reduce phase, which combines random task replication, random task verification, and credit accumulation, and CCMR strives to reduce the overhead by reducing cross-cloud communication. We implement our approach based on Apache Hadoop MapReduce and evaluate our implementation on Amazon EC2. Both theoretical and experimental analysis show that our approach can guarantee high result integrity in a normal cloud environment while incurring non-negligible performance overhead (e.g., when 16.7% workers are malicious, CCMR can guarantee at least 99.52% of accuracy with 33.6% of overhead when replication probability is 0.3 and the credit threshold is 50).
  • Keywords
    cloud computing; trusted computing; virtual machines; Apache Hadoop MapReduce; CCMR; MapReduce computations; credit accumulation; cross cloud MapReduce; hybrid clouds; map phase; novel MapReduce framework; private cloud; public clouds; random task verification; reduce phase; replication probability; result integrity check scheme; task replication; trusted cloud; virtual machines; Accuracy; Buffer storage; Cloud computing; Error analysis; History; Security; Virtual machining; Hybrid Cloud; Integrity Assurance; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-0-7695-5028-2
  • Type

    conf

  • DOI
    10.1109/CLOUD.2013.118
  • Filename
    6740233