Title :
A high-throughput adaptive computing infrastructure for bioinformatics research
Author :
Pineo, Stuart ; Wang, Zhengyu
Author_Institution :
Celera Genomics, Rockville, MD, USA
Abstract :
To support our company´s bioinformatics research and product development we developed the Celera computing analysis pipeline infrastructure (CCAPI). The N-tiered architecture, which includes a Web based user interface, an Oracle database, and a network of hundreds of CPUs, allows us to perform complex and compute-intensive data processing and analysis on genomic content in a cost-effective way. Developed using Perl, Java/JSP, Oracle´s AQ and Platform LSF, CCAPI creates an adaptive and scalable computing framework that supports task decomposition and parallelization, optimizes computing resource allocation, facilitates the integration of proprietary and third-party applications, and supports automatic error detection and recovery.
Keywords :
Internet; Java; biology computing; online front-ends; parallel processing; pipeline processing; relational databases; scientific information systems; user interfaces; Celera computing analysis pipeline infrastructure; Java/JSP; N-tiered architecture; Oracle database; Perl; Web based user interface; adaptive computing; bioinformatics; computer network; computing resource allocation optimization; data analysis; data processing; error detection; error recovery; genomic content; product development; scalable computing; task decomposition; task parallelization; Bioinformatics; Computer architecture; Computer interfaces; Computer networks; Concurrent computing; Databases; Pipelines; Product development; Service oriented architecture; User interfaces;
Conference_Titel :
Systems Engineering, 2005. ICSEng 2005. 18th International Conference on
Print_ISBN :
0-7695-2359-5
DOI :
10.1109/ICSENG.2005.7