DocumentCode
505970
Title
Advanced data flow support for scientific grid workflow applications
Author
Qin, Jun ; Fahringer, Thomas
Author_Institution
University of Innsbruck, Innsbruck, Austria
fYear
2007
fDate
10-16 Nov. 2007
Firstpage
1
Lastpage
12
Abstract
Existing work does not provide a flexible dataset-oriented data flow mechanism to meet the complex requirements of scientific Grid workflow applications. In this paper we present a sophisticated approach to this problem by introducing a data collection concept and the corresponding collection distribution constructs, which are inspired by HPF, however applied to Grid workflow applications. Based on these constructs, more fine-grained data flows can be specified at an abstract workflow language level, such as mapping a portion of a dataset to an activity, independently distributing multiple datasets, not necessarily with the same number of data elements, onto loop iterations. Our approach reduces data duplication, optimizes data transfers as well as simplifies the effort to port workflow applications onto the Grid. We have extended AGWL with these concepts and implemented the corresponding runtime support in ASKALON. We apply our approach to some real world scientific workflow applications and report performance results.
Keywords
Application software; Computer science; Control systems; Data engineering; Engineering management; Grid computing; Permission; Resource management; Runtime; Technology management; data collection; data distribution; data flow; grid workflow;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 2007. SC '07. Proceedings of the 2007 ACM/IEEE Conference on
Conference_Location
Reno, NV, USA
Print_ISBN
978-1-59593-764-3
Electronic_ISBN
978-1-59593-764-3
Type
conf
DOI
10.1145/1362622.1362679
Filename
5348801
Link To Document