DocumentCode
1525977
Title
CMS Workflow Execution Using Intelligent Job Scheduling and Data Access Strategies
Author
Hasham, Khawar ; Peris, Antonio Delgado ; Anjum, Ashiq ; Evans, Dave ; Gowdy, Stephen ; Hernández, José M. ; Huedo, Eduardo ; Hufnagel, Dirk ; Van Lingen, Frank ; McClatchey, Richard ; Metson, Simon
Author_Institution
Comput. Sci., UWE Bristol, Bristol, UK
Volume
58
Issue
3
fYear
2011
fDate
6/1/2011 12:00:00 AM
Firstpage
1221
Lastpage
1232
Abstract
Complex scientific workflows can process large amounts of data using thousands of tasks. The turnaround times of these workflows are often affected by various latencies such as the resource discovery, scheduling and data access latencies for the individual workflow processes or actors. Minimizing these latencies will improve the overall execution time of a workflow and thus lead to a more efficient and robust processing environment. In this paper, we propose a pilot job concept that has intelligent data reuse and job execution strategies to minimize the scheduling, queuing, execution and data access latencies. The results have shown that significant improvements in the overall turnaround time of a workflow can be achieved with this approach. The proposed approach has been evaluated, first using the CMS Tier0 data processing workflow, and then simulating the workflows to evaluate its effectiveness in a controlled environment.
Keywords
administrative data processing; information retrieval; physics computing; position sensitive particle detectors; scheduling; CMS Tier0 data processing workflow; CMS workflow execution; data access strategies; job execution strategies; job scheduling; pilot job concept; Data processing; Delay; Detectors; Large Hadron Collider; Monitoring; Peer to peer computing; Physics; Data cache; grid; latency; pilot jobs; workflows;
fLanguage
English
Journal_Title
Nuclear Science, IEEE Transactions on
Publisher
ieee
ISSN
0018-9499
Type
jour
DOI
10.1109/TNS.2011.2146276
Filename
5773464
Link To Document