DocumentCode :
122961
Title :
Cloud-based parallel suffix array construction based on MPI
Author :
Abdelhadi, Ahmed ; Kandil, A.H. ; Abouelhoda, Mohamed
Author_Institution :
Syst. & Biomed. Eng. Dept., Cairo Univ., Cairo, Egypt
fYear :
2014
fDate :
17-20 Feb. 2014
Firstpage :
334
Lastpage :
337
Abstract :
Massive amount of genomics data are being produced nowadays by Next Generation Sequencing machines. The suffix array is currently the best choice for indexing genomics data, because of its efficiency and large number of applications. In this paper, we address the problem of constructing the suffix array on computer cluster in the cloud. We present a solution that automates the establishment of a computer cluster in a cloud and automatically constructs the suffix array in a distributed fashion over the cluster nodes. This has the advantage of encapsulating all set-up details and execution of the algorithm. The distributed nature of the algorithm we use overcomes the problem that arises when the user wishes, due to cost issues, to use low memory machines in the cloud. Our experiments show that our implementation scales well with the increasing number of processors. The cloud cost is affordable and it provides a cost effective solution.
Keywords :
DNA; biology computing; cloud computing; genomics; molecular biophysics; MPI; cloud-based parallel suffix array construction; computer cluster; genomics data; low memory machines; next generation sequencing machines; Arrays; Bioinformatics; Cloud computing; Clustering algorithms; Computers; Program processors; Sorting; Cloud Computing; Distributed Computing; Suffix Array;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Engineering (MECBME), 2014 Middle East Conference on
Conference_Location :
Doha
Type :
conf
DOI :
10.1109/MECBME.2014.6783271
Filename :
6783271
Link To Document :
بازگشت