• DocumentCode
    1421510
  • Title

    EvolvingSpace: A Data Centric Framework for Integrating Bioinformatics Applications

  • Author

    Wang, Chen ; Zhou, Bing Bing ; Zomaya, Albert Y.

  • Author_Institution
    ICT Center, CSIRO, Epping, NSW, Australia
  • Volume
    59
  • Issue
    6
  • fYear
    2010
  • fDate
    6/1/2010 12:00:00 AM
  • Firstpage
    721
  • Lastpage
    734
  • Abstract
    The paper presents EvolvingSpace, a data centric distributed system, which is intended to address the data and application integration problem in bioinformatics data centers. The system employs commodity PCs for data storage and computation. EvolvingSpace manages data in a decentralized manner, which is convenient for storing data annotations and can eliminate potential data-access bottlenecks. It indexes distributed data in multilevels to facilitate the construction of complex workflows that consist of applications running on different types of data. In addition, the paper proposes a data locality and workflow aware scheduling algorithm (ES-Scheduling) to balance the data distribution and computing performance as well as throughput and workflow response time. We run extensive experiments using the system with real bioinformatics applications. Our results show that the system is efficient for running integrated bioinformatics applications and has good scalability.
  • Keywords
    bioinformatics; computer centres; data handling; scheduling; workflow management software; EvolvingSpace; bioinformatics application integration; bioinformatics data centers; commodity PCs; complex workflows; data centric distributed system; data locality; data storage; workflow aware scheduling algorithm; Bioinformatics; Data analysis; Data models; Distributed computing; Genomics; Memory; Personal communication networks; Processor scheduling; Relational databases; Scalability; Scheduling algorithm; Distributed systems; bioinformatics.; data models; data sharing; scheduling; workflow management;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2010.39
  • Filename
    5416682