Title :
Integrating Policy with Scientific Workflow Management for Data-Intensive Applications
Author :
Chervenak, Ann L. ; Smith, D.E. ; Weiwei Chen ; Deelman, Ewa
Author_Institution :
Inf. Sci. Inst., Univ. of Southern California, Marina del Rey, CA, USA
Abstract :
As scientific applications generate and consume data at ever-increasing rates, scientific workflow systems that manage the growing complexity of analyses and data movement will increase in importance. The goal of our work is to improve the overall performance of scientific workflows by using policy to improve data staging into and out of computational resources. We developed a Policy Service that gives advice to the workflow system about how to stage data, including advice on the order of data transfers and on transfer parameters. The Policy Service gives this advice based on its knowledge of ongoing transfers, recent transfer performance, and the current allocation of resources for data staging. The paper describes the architecture of the Policy Service and its integration with the Pegasus Workflow Management System. It employs a range of policies for data staging, and presents performance results for one policy that does a greedy allocation of data transfer streams between source and destination sites. The results show performance improvements for a data-intensive workflow: the Montage astronomy workflow augmented to perform additional large data staging operations.
Keywords :
astronomy computing; data handling; electronic data interchange; resource allocation; software architecture; software performance evaluation; workflow management software; Montage astronomy workflow; Pegasus workflow management system; computational resources; data movement; data transfers; data-intensive applications; greedy allocation; large data staging operations; performance improvement; policy integration; policy service architecture; resource allocation; scientific applications; scientific workflow management; scientific workflow systems; transfer parameters; transfer performance; data placement; greedy allocation policy; policy service; scientific workflow;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
DOI :
10.1109/SC.Companion.2012.29