• DocumentCode
    2269964
  • Title

    Dynamic Workflow Management and Monitoring Using DDS

  • Author

    Pan, Pan ; Dubey, Abhishek ; Piccoli, Luciano

  • Author_Institution
    Inst. for Software Integrated Syst., Vanderbilt Univ., Nashville, TN, USA
  • fYear
    2010
  • fDate
    22-26 March 2010
  • Firstpage
    20
  • Lastpage
    29
  • Abstract
    Large scientific computing data-centers require a distributed dependability subsystem that can provide fault isolation and recovery and is capable of learning and predicting failures to improve the reliability of scientific workflows. This paper extends our previous work on the autonomic scientific workflow management systems by presenting a hierarchical dynamic workflow management system that tracks the state of job execution using timed state machines. Workflow monitoring is achieved using a reliable distributed monitoring framework, which employs publish-subscribe middleware built upon OMG Data Distribution Service standard. Failure recovery is achieved by stopping and restarting the failed portions of workflow directed acyclic graph.
  • Keywords
    computer centres; directed graphs; message passing; middleware; natural sciences computing; workflow management software; OMG data distribution service standard; autonomic scientific workflow management systems; distributed dependability subsystem; distributed monitoring framework; failure recovery; fault isolation; hierarchical dynamic workflow management system; large scientific computing data centers; publish-subscribe middleware; scientific workflow reliability; timed state machines; workflow directed acyclic graph; workflow monitoring; Computer network management; Maintenance; Monitoring; Neural networks; Ontologies; Proposals; Resource management; Scalability; Software systems; Technology management; autonomic computing; workflow management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering of Autonomic and Autonomous Systems (EASe), 2010 Seventh IEEE International Conference and Workshops on
  • Conference_Location
    Oxford
  • Print_ISBN
    978-1-4244-6535-4
  • Electronic_ISBN
    978-1-4244-6536-1
  • Type

    conf

  • DOI
    10.1109/EASe.2010.12
  • Filename
    5457822