DocumentCode
2591123
Title
bCloudBLAST: An efficient mapreduce program for bioinformatics applications
Author
Meng, Zhen ; Li, Jianhui ; Zhou, Yunchun ; Liu, Qi ; Liu, Yong ; Cao, Wei
Author_Institution
Sci. Data Center, Comput. Network Inf. Center, Beijing, China
Volume
4
fYear
2011
fDate
15-17 Oct. 2011
Firstpage
2072
Lastpage
2076
Abstract
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. We present an improved MapReduce-parallel implementation by splitting both of input query sequence files and sequence databases for search, called bCloudBLAST, showing very good scaling and speedup behavior on large sequence database. bCloudBLAST is written in Java, executable on UNIX/Linux, Windows and NacOS systems. (Free) MapReduce and Hadoop libraries can be found at http://hadoop.apache.org/. For download: http://www.darwintree.cn/tools.htm.
Keywords
DNA; Java; Linux; bioinformatics; molecular biophysics; molecular configurations; parallel databases; proteins; query processing; DNA databases; Hadoop libraries; Java; Linux; MapReduce program; NacOS systems; UNIX; Windows; bCloudBLAST; bioinformatics; input query sequence files; parallel implementation; protein databases; sequence databases; sequence similarities; Bioinformatics; Computer architecture; Databases; Phylogeny; Protein sequence; Virtual machining; BLAST; Bioinformatics; Cloud computing; MapReduce; bCloudBLAST;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical Engineering and Informatics (BMEI), 2011 4th International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-9351-7
Type
conf
DOI
10.1109/BMEI.2011.6098717
Filename
6098717
Link To Document