DocumentCode :
572908
Title :
Cloud-GSQCT:a parallel approach to screen gene sequences for phylogenetics analysis
Author :
Meng, Zhen ; Xiao, Xiao ; Li, Jianhui ; Zhou, Yuanchun ; Cao, Wei ; Shen, Geng
Author_Institution :
Sci. Data Center, Comput. Network Inf. Center, Beijing, China
fYear :
2012
fDate :
24-26 Aug. 2012
Firstpage :
660
Lastpage :
663
Abstract :
Screening data for phylogenetic analysis from large datasets is a known computational problem of data-intensive application. In this paper, we implement a parallel approach, Cloud-GSQCT (Cloud Gene Sequence Quality Control Tool), to screen gene sequence data for phylogenetic analysis, using the MapReduce paradigm to parallelize the solution and to manage its execution. The parallel approach using Hadoop are implemented and the evaluation of the approach is also presented. For download: http://www.darwintree.cn/tools.htm.
Keywords :
biology computing; cloud computing; genetics; parallel processing; quality control; Cloud-GSQCT; Hadoop; MapReduce paradigm; cloud gene sequence quality control tool; computational problem; data-intensive application; execution management; gene sequence data screening; gene sequences screening; parallel approach; phylogenetics analysis; Biology; Databases; Hardware; High definition video; Data screening; GSQCT; Hadoop; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Processing (CSIP), 2012 International Conference on
Conference_Location :
Xi´an, Shaanxi
Print_ISBN :
978-1-4673-1410-7
Type :
conf
DOI :
10.1109/CSIP.2012.6308940
Filename :
6308940
Link To Document :
بازگشت