DocumentCode :
2175325
Title :
CAMERA 2.0: A Data-centric Metagenomics Community Infrastructure Driven by Scientific Workflows
Author :
Altintas, Ilkay ; Lin, Abel W. ; Chen, Jing ; Churas, Chris ; Gujral, Madhusudan ; Sun, Shulei ; Li, Weizhong ; Manansala, Ramil ; Sedova, Mayya ; Grethe, Jeffrey S. ; Ellisman, Mark
Author_Institution :
San Diego Supercomput. Center, Univ. of California, San Diego, La Jolla, CA, USA
fYear :
2010
fDate :
5-10 July 2010
Firstpage :
352
Lastpage :
359
Abstract :
Over the last decade, workflows have been established as a mechanism for scientific developers to create simplified views of complex scientific processes. However, there is a need for a comprehensive system architecture to link scientific developers creating workflows with researchers launching workflows in large scale computing environments. We present the architecture for the CAMERA 2.0 Cyber infrastructure platform that provides a scaffold where workflows can be uploaded into the system, and user interface components for launching and viewing results are automatically generated. In CAMERA 2.0, scientific developers and metagenomics researchers seamlessly collaborate to (i) wrap data-analysis software applications and heterogeneous tools as Resource Oriented Architecture (ROA) components integrating them using scientific workflows; (ii) publish and run scientific workflows via dynamically generated uniform portal interfaces; (iii) map heterogeneous workflow products to provenance and CAMERA semantic database through a transformation component, to save output data resulting from workflow runs based on this mapping; (iv) record and visualize the provenance of all workflow run-related data and processes; and (v) conduct queries across multiple workflow executions and link these workflow executions to each other through data and provenance related to these runs. Furthermore, workflows added to CAMERA also have access to a variety of physical resources for computation and data management. Here, we demonstrate the usability of this framework with some of the developed metagenomics workflows.
Keywords :
bioinformatics; data analysis; ecology; genomics; meta data; software architecture; user interfaces; workflow management software; CAMERA 2.0; CAMERA semantic database; comprehensive system architecture; cyberinfrastructure platform; data analysis software application; data centric metagenomics community infrastructure; data management; heterogeneous tool; large scale computing environment; resource oriented architecture component; scientific developer; scientific workflow; uniform portal interfaces; user interface component; workflow run-related data; Cameras; Collaboration; Communities; Computer architecture; Portals; Semantics; User interfaces; collaboration; metagenomics; provenance; scientific workflows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services (SERVICES-1), 2010 6th World Congress on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-8199-6
Electronic_ISBN :
978-0-7695-4129-7
Type :
conf
DOI :
10.1109/SERVICES.2010.89
Filename :
5577257
Link To Document :
بازگشت