DocumentCode
1564774
Title
Gene sequence alignment on a public computing platform
Author
Pellicer, Stephen ; Ahmed, Nova ; Pan, Yi ; Zheng, Yao
Author_Institution
Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
fYear
2005
Firstpage
95
Lastpage
102
Abstract
Public computing can potentially supply not only computational power but also memory and short term storage resources to grid and cluster scale problems. Gene sequence alignment is a fundamental computational challenge in bioinformatics with attributes such as moderate computational requirements, extensive memory requirements, and highly interdependent tasks. This study examines the performance of calculating the alignment for two 100,000 base sequences on a public computing platform utilizing the BOINC framework. When compared to the theoretical, optimal sequential implementation, the parallel implementation achieves speedup by a factor of 1.4 and at the point of maximum parallelism and ends with a speedup of 1.2. This speedup factor is based on extrapolation of the sequential performance of a segment of the problem. This extrapolation would require a theoretical sequential machine with approximately 37.3 GB of working memory or suffer performance degradation from use of secondary storage during the calculation.
Keywords
Internet; biology computing; genetics; BOINC framework; bioinformatics; cluster scale problem; gene sequence alignment; grid scale problem; optimal sequential implementation; performance degradation; public computing; secondary storage; short term storage resources; theoretical sequential machine; Bioinformatics; Biology computing; Computer architecture; Computer science; Extrapolation; Grid computing; Internet; Parallel processing; Processor scheduling; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 2005. ICPP 2005 Workshops. International Conference Workshops on
ISSN
1530-2016
Print_ISBN
0-7695-2381-1
Type
conf
DOI
10.1109/ICPPW.2005.35
Filename
1488682
Link To Document