DocumentCode
3298752
Title
Result Integrity Check for MapReduce Computation on Hybrid Clouds
Author
Yongzhi Wang ; Jinpeng Wei ; Srivatsa, Mudhakar
Author_Institution
Florida Int. Univ., Miami, FL, USA
fYear
2013
fDate
June 28 2013-July 3 2013
Firstpage
847
Lastpage
854
Abstract
Large scale adoption of MapReduce computations on public clouds is hindered by the lack of trust on the participating virtual machines, because misbehaving worker nodes can compromise the integrity of the computation result. In this paper, we propose a novel MapReduce framework, Cross Cloud MapReduce (CCMR), which overlays the MapReduce computation on top of a hybrid cloud: the master that is in control of the entire computation and guarantees result integrity runs on a private and trusted cloud, while normal workers run on a public cloud. In order to achieve high accuracy, CCMR proposes a result integrity check scheme on both the map phase and the reduce phase, which combines random task replication, random task verification, and credit accumulation, and CCMR strives to reduce the overhead by reducing cross-cloud communication. We implement our approach based on Apache Hadoop MapReduce and evaluate our implementation on Amazon EC2. Both theoretical and experimental analysis show that our approach can guarantee high result integrity in a normal cloud environment while incurring non-negligible performance overhead (e.g., when 16.7% workers are malicious, CCMR can guarantee at least 99.52% of accuracy with 33.6% of overhead when replication probability is 0.3 and the credit threshold is 50).
Keywords
cloud computing; trusted computing; virtual machines; Apache Hadoop MapReduce; CCMR; MapReduce computations; credit accumulation; cross cloud MapReduce; hybrid clouds; map phase; novel MapReduce framework; private cloud; public clouds; random task verification; reduce phase; replication probability; result integrity check scheme; task replication; trusted cloud; virtual machines; Accuracy; Buffer storage; Cloud computing; Error analysis; History; Security; Virtual machining; Hybrid Cloud; Integrity Assurance; MapReduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference on
Conference_Location
Santa Clara, CA
Print_ISBN
978-0-7695-5028-2
Type
conf
DOI
10.1109/CLOUD.2013.118
Filename
6740233
Link To Document