DocumentCode
2321132
Title
TomusBlobs: Towards Communication-Efficient Storage for MapReduce Applications in Azure
Author
Tudoran, Radu ; Costan, Alexandru ; Antoniu, Gabriel ; Soncu, Hakan
Author_Institution
INRIA Rennes Bretagne Atlantique, Rennes, France
fYear
2012
fDate
13-16 May 2012
Firstpage
427
Lastpage
434
Abstract
The emergence of cloud computing brought the opportunity to use large-scale compute infrastructures for a broad spectrum of applications and users. As the cloud paradigm gets attractive for the " elasticity\´\´ in resource usage and associated costs (the users only pay for resources actually used), cloud applications still suffer from the high latencies and low performance of cloud storage services. Enabling high-throughput massive data processing on cloud data becomes a critical issue, as it impacts the overall application performance. In this paper we address the above challenge at the level of the cloud storage. We introduce a concurrency-optimized data storage system which federates the virtual disks associated to VMs. We demonstrate the performance of our solution for efficient data-intensive processing on commercial clouds by building an optimized prototype MapReduce framework for Azure that leverages the benefits of our storage solution. We perform extensive synthetic benchmarks as well as experiments with real-world applications: they demonstrate that our solution brings substantial benefits to data intensive applications compared to approaches relying on state-of-the-art cloud object storage.
Keywords
cloud computing; concurrency control; resource allocation; storage management; virtual machines; Azure; MapReduce applications; TomusBlobs; VM; cloud applications; cloud computing; cloud data; cloud object storage; cloud paradigm; cloud storage service performance; communication-efficient storage; concurrency-optimized data storage system; data-intensive processing; high-throughput massive data processing; resource usage; virtual disks; Cloud computing; Computer architecture; Distributed databases; Prototypes; Scalability; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on
Conference_Location
Ottawa, ON
Print_ISBN
978-1-4673-1395-7
Type
conf
DOI
10.1109/CCGrid.2012.104
Filename
6217450
Link To Document