DocumentCode :
168607
Title :
Bridging Data in the Clouds: An Environment-Aware System for Geographically Distributed Data Transfers
Author :
Tudoran, Radu ; Costan, Alexandru ; Rui Wang ; Bouge, Luc ; Antoniu, Gabriel
Author_Institution :
INRIA Rennes - Bretagne Atlantique, Rennes, France
fYear :
2014
fDate :
26-29 May 2014
Firstpage :
92
Lastpage :
101
Abstract :
Today´s continuously growing cloud infrastructures provide support for processing ever increasing amounts of scientific data. Cloud resources for computation and storage are spread among globally distributed datacenters. Thus, to leverage the full computation power of the clouds, global data processing across multiple sites has to be fully enabled. However, managing data across geographically distributed data enters is not trivial as it involves high and variable latencies among sites which come at a high monetary cost. In this work, we propose a uniform data management system for scientific applications running across geographically distributed sites. Our solution is environment-aware, as it monitors and models the global cloud infrastructure, and offers predictable data handling performance for transfer cost and time. In terms of efficiency, it provides the applications with the possibility to set a trade off between money and time and optimizes the transfer strategy accordingly. The system was validated on Microsoft´s Azure cloud across the 6 EU and US data enters. The experiments were conducted on hundreds of nodes using both synthetic benchmarks and the real life A-Brain application. The results show that our system is able to model and predict well the cloud performance and to leverage this into efficient data dissemination. Our approach reduces the monetary costs and transfer time by up to 3 times.
Keywords :
cloud computing; computer centres; data handling; database management systems; resource allocation; EU data enters; Microsoft Azure cloud; US data enters; cloud resources computation; cloud resources storage; data handling performance; data management system; environment-aware system; geographically distributed data enters; geographically distributed data transfers; global cloud infrastructure; global data processing; globally distributed data centers; scientific applications; scientific data processing; transfer strategy; Cloud computing; Data models; Data transfer; Distributed databases; Mathematical model; Monitoring; Throughput; Azure; Big Data; cloud computing; data management; geographically distributed datacenters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/CCGrid.2014.86
Filename :
6846444
Link To Document :
بازگشت