DocumentCode
683825
Title
Building localized bioinformatics platform based on Galaxy and high performance computing cluster
Author
Xiao-Lei Wang ; Jiang-yu Li ; Yang Liu ; Yu-feng Wang ; Dong-sheng Zhao
Author_Institution
Inst. of Health Service & Med. Inf., Acad. of Mil. Med. Sci., Beijing, China
fYear
2013
fDate
16-18 Dec. 2013
Firstpage
712
Lastpage
716
Abstract
With the rapid development of high-throughput sequencing technology, biomedical research has entered into the era of big data. It causes problems about storage and analysis of massive biological data which need to be solved by high-performance computing. Therefore, we build the localized high-performance one-stop data analysis platform to provide convenient and efficient computational analysis services for biomedical researchers. We deploy Galaxy and integrate software tools and datasets into Galaxy in computing cluster, build stable web service, FTP service and management database in order to optimize and improve the performance of Galaxy, and use distributed resource management application interface to collaborate Galaxy with Sun Grid Engine for automatically scheduling and assigning computing resources. Currently the platform has been put into trial operation. The peak performance is 10 Teraflops and the capacity of storage is 40TB. The platform provides many functions such as sequence alignment, short sequence mapping, gene annotation, transcriptome analysis, metagenomic analysis and phylogenetic analysis, and approximately 700GB reference databases including human genome, viruses, bacteria, fungi, etc.
Keywords
Big Data; Web services; bioinformatics; data analysis; molecular biophysics; parallel processing; resource allocation; scheduling; FTP service; Galaxy; Sun Grid Engine; Web service; bacteria database; big data; biomedical research; computational analysis services; data storage; distributed resource management application; file transfer protocol; fungi database; gene annotation; high performance computing cluster; high-throughput sequencing technology; human genome database; localized bioinformatics platform; localized high-performance one-stop data analysis platform; management database; metagenomic analysis; phylogenetic analysis; resource scheduling; sequence alignment; short sequence mapping; software tools; transcriptome analysis; virus database; Bioinformatics; Data analysis; Genomics; Microorganisms; Software; Visual databases; Bioinformatics; Galaxy; High-performance Computing; Localized; Online analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4799-2760-9
Type
conf
DOI
10.1109/BMEI.2013.6747031
Filename
6747031
Link To Document