DocumentCode :
726523
Title :
Challenges of Large-Scale Biomedical Workflows on the Cloud -- A Case Study on the Need for Reproducibility of Results
Author :
Kanwal, Sehrish ; Lonie, Andrew ; Sinnott, Richard O. ; Anderson, Charlotte
Author_Institution :
Dept. of Comput. & Inf. Syst., Univ. of Melbourne, Melbourne, VIC, Australia
fYear :
2015
fDate :
22-25 June 2015
Firstpage :
220
Lastpage :
225
Abstract :
Computational bioinformatics workflows are extensively used to analyse genomics data. With the unprecedented advancements in genomic sequence technology and opportunities for personalized medicines, it is essential that analysis results are repeatable by others, especially when moving into clinical environment. To cope with the complex computational demands of huge biological datasets, a shift to distributed compute resources is unavoidable. A case study was conducted in which three well established bioinformatics analysis groups across Australia were assigned to analyse exome sequence data from a range of patients with a rare condition: disorder of sex development. Initially these groups used their own in-house data processing pipelines, and subsequently used a common bioinformatics workbench based upon Galaxy and offered through the Australia-wide National eResearch Collaboration Tools and Resources (NeCTAR) Research Cloud. This paper describes the experiences in this work and the variability of results. We put forward principles that should be used to ensure reproducibility of scientific results moving forward.
Keywords :
bioinformatics; cloud computing; data analysis; genomics; medical disorders; cloud computing; computational bioinformatics workflows; exome sequence data analysis; genomic data analysis; genomic sequence technology; large-scale biomedical workflows; personalized medicines; sex development disorder; Bioinformatics; DNA; Genomics; Laboratories; Sequential analysis; Software; NeCTAR Research Cloud; bioinformatics workflows; distributed compute resources; exome; reproducibility;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2015 IEEE 28th International Symposium on
Conference_Location :
Sao Carlos
Type :
conf
DOI :
10.1109/CBMS.2015.28
Filename :
7167490
Link To Document :
بازگشت