• DocumentCode
    2790179
  • Title

    An Automated Data Processing Pipeline for Virus Structure Determination at High Resolution

  • Author

    Yu, Chen ; Marinescu, Dan C. ; Morrison, John P. ; Clayton, Brian C. ; Power, David A.

  • Author_Institution
    Sch. of EECS, Central Florida Univ., Orlando, FL
  • fYear
    2007
  • fDate
    26-30 March 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The automation of the data processing pipeline for virus structure determination at high resolution is a very challenging problem. The interaction between the data collection process and the theoretical modeling and computer simulation is very complex; routine tasks are mixed with decision making processes and unforeseen conditions. This paper dissects some of the most difficult problems posed by the dynamic coordination of complex computational tasks in a large scale distributed data acquisition and analysis system. A flexible coordination model should be capable of accommodating user actions, handling system related activities such as resource discovery and resource allocation, permitting dynamic process description modification, allowing different level of abstraction, providing some level of fault tolerance and backtracking capability. The condensed graphs model of computing developed at University College Cork (UCC) which combines availability-, demand-, and control-driven computation seems to be the most promising for certain classes of problems and complements our previous efforts in developing an intelligent environment for large scale-distributed data acquisition and analysis workflow applications.
  • Keywords
    biology computing; data acquisition; data analysis; graph theory; pipeline processing; automated data processing pipeline; backtracking; computer simulation; condensed graph model; data analysis system; data collection process; decision making process; distributed data acquisition; dynamic process description modification; fault tolerance; high resolution; intelligent environment; resource allocation; resource discovery; virus structure determination; Automation; Computer simulation; Data acquisition; Data analysis; Data processing; Decision making; Distributed computing; Large-scale systems; Pipelines; Resource management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    1-4244-0910-1
  • Electronic_ISBN
    1-4244-0910-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2007.370452
  • Filename
    4228180