DocumentCode :
580075
Title :
VGrADS: enabling e-Science workflows on grids and clouds with fault tolerance
Author :
Ramakrishnan, Lavanya ; Koelbel, C. ; Yang-suk Kee ; Wolski, Richard ; Nurmi, D. ; Gannon, Dennis ; Obertelli, G. ; YarKhan, Asim ; Mandal, Avirup ; Huang, T.M. ; Thyagaraja, K. ; Zagorodnov, D.
Author_Institution :
Indiana Univ., Bloomington, IN, USA
fYear :
2009
fDate :
14-20 Nov. 2009
Firstpage :
1
Lastpage :
12
Abstract :
Today´s scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition, especially for time-sensitive critical applications, predictable quality of service is necessary across these distributed resources. VGrADS´ virtual grid execution system (vgES) provides an uniform qualitative resource abstraction over grid and cloud systems. We apply vgES for scheduling a set of deadline sensitive weather forecasting workflows. Specifically, this paper reports on our experiences with (1) virtualized reservations for batchqueue systems, (2) coordinated usage of TeraGrid (batch queue), Amazon EC2 (cloud), our own clusters (batch queue) and Eucalyptus (cloud) resources, and (3) fault tolerance through automated task replication. The combined effect of these techniques was to enable a new workflow planning method to balance performance, reliability and cost considerations. The results point toward improved resource selection and execution management support for a variety of e-Science applications over grids and cloud systems.
Keywords :
cloud computing; geophysics computing; grid computing; quality of service; resource allocation; software fault tolerance; weather forecasting; Amazon EC2; VGrADS; automated task replication; batch queue; batchqueue system; cloud interface; cloud resource; cloud system; distributed heterogeneous resource; distributed resource; diverse grid; e-science workflow; eucalyptus resource; execution management support; fault tolerance; grid system; qualitative resource abstraction; quality of service; resource selection; scientific workflow; teragrid; vgES; virtual grid execution system; virtualized reservation; weather forecasting workflow scheduling; workflow planning method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location :
Portland, OR
Type :
conf
DOI :
10.1145/1654059.1654107
Filename :
6375523
Link To Document :
بازگشت