Title :
New Execution Paradigm for Data-Intensive Scientific Workflows
Author :
El-Gayyar, Mahmoud ; Leng, Yan ; Shumilov, Serge ; Cremers, Armin
Author_Institution :
Dept. of Comput. Sci. III, Univ. of Bonn, Bonn, Germany
Abstract :
With the advent of grid and service-oriented technologies, scientific workflows have been introduced in response to the increasing demand of researchers for assembling diverse, highly-specialized applications, allowing them to exchange large heterogeneous datasets in order to accomplish a complex scientific task. Much research has already been done to provide efficient scientific workflow management systems (WfMS). However, most of such WfMS are coordinating and executing workflows in a centralized fashion. This creates a single point of failure, forms a scalability bottleneck, and often leads to excessive traffic routed back to the coordinator. Additionally, none of the available WfMS provides means for dynamic data transformation between services in order to overcome the data heterogeneity problem. This work presents a new approach for scientific workflow management targeted to provide ways for an efficient distributed execution of data-intensive workflows. The proposed approach reduces the communication traffic between services and overcomes the data heterogeneity problem. Moreover, it allows full control over long-running applications, as well as provides support for smart re-run, distributed fault handling and distributed load balancing.
Keywords :
Web services; grid computing; natural sciences computing; resource allocation; workflow management software; Web service; communication traffic reduction; data heterogeneity problem; data-intensive scientific workflows; data-intensive workflows; distributed execution; distributed fault handling; distributed load balancing; dynamic data transformation; grid technology; scientific workflow management systems; service-oriented technology; Application software; Assembly; Communication system control; Communication system traffic control; Computer science; Load management; Ontologies; Scalability; Water resources; Workflow management software; WSDM; Workflows; data transfer; distributed execution; integration; services;
Conference_Titel :
Services - I, 2009 World Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3708-5
Electronic_ISBN :
978-0-7695-3708-5
DOI :
10.1109/SERVICES-I.2009.22