DocumentCode
3352403
Title
Trellis driver: distributing a java workflow across a network of workstations
Author
Lamb, N. ; Lu, P. ; Fyshe, A.
Author_Institution
University of Alberta
fYear
2004
fDate
18-18 Aug. 2004
Firstpage
198
Lastpage
205
Abstract
Some applications in science and engineering consist of a main job that invokes, or drives, other jobs. For example, a server process may receive a request, then invoke a workflow of stand-alone scripts or executables to handle the request, and then generate the final response. Java??s Runtime.exec() function allows jobs to be invoked from within a master Java program. However, these jobs are usually restricted to the same machine. If the number of jobs in the workflow is large, then it can be desirable to load balance the workload across different servers to maximize throughput. We describe the design and implementation of the Trellis Driver, a newly-developed Java module that runs jobs using TrellisDriver.exec() and allows jobs to be scheduled across clusters and metacomputers (i.e., aggregations of servers). Using a Java-based bioinformatics application as a case study, we evaluate the performance improvement Trellis Driver offers through workflow parallelism.
Keywords
Bioinformatics; Concurrent computing; Java; Parallel processing; Pipelines; Proteins; Shape control; Throughput; Workstations; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Workshops, 2004. ICPP 2004 Workshops. Proceedings. 2004 International Conference on
Conference_Location
Montreal, QC, Canada
ISSN
1530-2016
Print_ISBN
0-7695-2198-3
Type
conf
DOI
10.1109/ICPPW.2004.1328018
Filename
1328018
Link To Document