DocumentCode :
168757
Title :
A Storage Policy for a Hybrid Federated Cloud platform: A Case Study for Bioinformatics
Author :
Lima, Deric ; Moura, Breno ; Oliveira, Gustavo ; Ribeiro, Eraldo ; Araujo, A. ; Holanda, Maristela ; Togawa, Roberto ; Walter, Maria Emilia
Author_Institution :
Dept. of Comput. Sci., Univ. of Brasilia, Brasilia, Brazil
fYear :
2014
fDate :
26-29 May 2014
Firstpage :
738
Lastpage :
747
Abstract :
Bioinformatics tools require large-scale processing mainly due to very large databases achieving gigabytes of size. In federated cloud environments, although services and resources may be shared, storage is particularly difficult, due to distinct computational capabilities and data management policies of several separated clouds. In this work, we propose a storage policy for BioNimbuZ, a hybrid federated cloud platform designed to execute bioinformatics applications. Our storage policy, BioClouZ, aims to perform efficient choices to distribute and replicate files to the best available cloud resources in the federation in order to reduce computational time. BioClouZ uses four parameters - latency, uptime, free size and cost, weighted (according to ad hoc tests) to model their influences to data storage and recovery. Experiments were performed with real biological data executing a commonly used tool to map short reads in a reference genome in BioNimbuZ, composed of clouds executing in Amazon EC2, Azure and University of Brasilia. The results showed that, when compared to the greedy algorithm first used in BioNimbuZ, the BioClouZ policy significantly improved the total execution time due to more efficient choices of the clouds to store the files. Other bioinformatics applications can be used with BioClouZ in BioNimbuZ as well, since the platform was designed independently from particular tools and databases.
Keywords :
bioinformatics; cloud computing; storage management; very large databases; Amazon EC2; Azure; BioClouZ; BioNimbuZ; University of Brasilia; bioinformatics tools; computational capabilities; computational time reduction; data management policies; data recovery; data storage; hybrid federated cloud platform; reference genome; storage policy; very large databases; Bioinformatics; Biological system modeling; Cloud computing; Computer architecture; Genomics; Prototypes; Servers; bioinformatics; hybrid federated cloud platform; storage policy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/CCGrid.2014.102
Filename :
6846526
Link To Document :
بازگشت