Title :
Blast-Parallel: The parallelizing implementation of sequence alignment algorithms based on Hadoop platform
Author :
Ming Meng ; Jing Gao ; Jun-jie Chen
Author_Institution :
Coll. of Comput. & Inf. Eng., Inner Mongolia Agric. Univ., Huhhot, China
Abstract :
The sequence alignment is a basic method for processing the information in Bioinformatics, it has a great significance for finding the function and the structure of nucleic acids and protein sequences and the information of evolution. This paper briefly describes the relevant issues of sequence alignment and the most common local sequence alignment algorithms, Blast algorithm. At present, the Blast algorithm which provided by NCBI or stand-alone can not meet the actual demand for the flood of biological data, this paper achieves the Blast-Parallel algorithm by further improvement based on the Hadoop-Blast algorithm. Through serial experiments of the stand-alone Blast algorithm and parallelizing experiments of the Hadoop-Blast algorithm and the Blast-Parallel algorithm based on Hadoop platform, results show that the Blast algorithm has significantly higher execution efficiency after the parallelization, and the matching speed of the Blast-Parallel algorithm which has been improved can achieve 1~1.5 times of the Hadoop-Blast algorithm.
Keywords :
bioinformatics; evolution (biological); genetics; genomics; molecular biophysics; molecular configurations; parallel algorithms; proteins; Hadoop-blast algorithm; NCBI; bioinformatics; biological data; blast-parallel algorithm; evolution information processing; flood data; local sequence alignment algorithms; nucleic acid structure function; parallelization; protein sequence structure function; Algorithm design and analysis; Bioinformatics; Clustering algorithms; Databases; Genetics; Heuristic algorithms; Parallel processing; Blast; Hadoop; Sequence alignment; parallelization;
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2760-9
DOI :
10.1109/BMEI.2013.6746988