DocumentCode
2908299
Title
Grid Approach to Embarrassingly Parallel CPU-Intensive Bioinformatics Problems
Author
Stockinger, Heinz ; Pagni, Marco ; Cerutti, Lorenzo ; Falquet, Laurent
Author_Institution
Swiss Institute of Bioinformatics, Vital-IT, Switzerland
fYear
2006
fDate
Dec. 2006
Firstpage
58
Lastpage
58
Abstract
Bioinformatics algorithms such as sequence alignment methods based on profile-HMM (Hidden Markov Model) are popular but CPU-intensive. If large amounts of data are processed, a single computer often runs for many hours or even days. High performance infrastructures such as clusters or computational Grids provide the techniques to speed up the process by distributing the workload to remote nodes, running parts of the work load in parallel. Biologists often do not have access to such hardware systems. Therefore, we propose a new system using a modern Grid approach to optimise an embarrassingly parallel problem. We achieve speed ups by at least two orders of magnitude given that we can use a powerful, world-wide distributed Grid infrastructure. For large-scale problems our method can outperform algorithms designed for mid-size clusters even considering additional latencies imposed by Grid infrastructures.
Keywords
Bioinformatics; Biology computing; Clustering algorithms; Concurrent computing; Distributed computing; Grid computing; Hardware; Hidden Markov models; High performance computing; Uninterruptible power systems;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Science and Grid Computing, 2006. e-Science '06. Second IEEE International Conference on
Conference_Location
Amsterdam, The Netherlands
Print_ISBN
0-7695-2734-5
Type
conf
DOI
10.1109/E-SCIENCE.2006.261142
Filename
4031031
Link To Document