Title :
Cloud-GSQCT:a parallel approach to screen gene sequences for phylogenetics analysis
Author :
Meng, Zhen ; Xiao, Xiao ; Li, Jianhui ; Zhou, Yuanchun ; Cao, Wei ; Shen, Geng
Author_Institution :
Sci. Data Center, Comput. Network Inf. Center, Beijing, China
Abstract :
Screening data for phylogenetic analysis from large datasets is a known computational problem of data-intensive application. In this paper, we implement a parallel approach, Cloud-GSQCT (Cloud Gene Sequence Quality Control Tool), to screen gene sequence data for phylogenetic analysis, using the MapReduce paradigm to parallelize the solution and to manage its execution. The parallel approach using Hadoop are implemented and the evaluation of the approach is also presented. For download: http://www.darwintree.cn/tools.htm.
Keywords :
biology computing; cloud computing; genetics; parallel processing; quality control; Cloud-GSQCT; Hadoop; MapReduce paradigm; cloud gene sequence quality control tool; computational problem; data-intensive application; execution management; gene sequence data screening; gene sequences screening; parallel approach; phylogenetics analysis; Biology; Databases; Hardware; High definition video; Data screening; GSQCT; Hadoop; MapReduce;
Conference_Titel :
Computer Science and Information Processing (CSIP), 2012 International Conference on
Conference_Location :
Xi´an, Shaanxi
Print_ISBN :
978-1-4673-1410-7
DOI :
10.1109/CSIP.2012.6308940