• DocumentCode
    2092495
  • Title

    Improving Throughput and Reliability of Distributed Scientific Workflows for Streaming Data Processing

  • Author

    Gu, Yi ; Wu, Qishi ; Liu, Xin ; Yu, Dantong

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Memphis, Memphis, TN, USA
  • fYear
    2011
  • fDate
    2-4 Sept. 2011
  • Firstpage
    347
  • Lastpage
    354
  • Abstract
    With the advent of next-generation scientific applications, the workflow-based computing technology has become an indispensable research method for managing and streamlining large-scale distributed data processing. This paper investigates a problem of mapping distributed workflows for streaming data processing in faulty networks where nodes and links are subject to probabilistic failures. We formulate this problem as a bi-objective optimization problem in terms of both throughput and reliability, and propose a decentralized layer-oriented method to achieve high throughput for smooth data flow while satisfying a prespecified overall failure rate bound for a guaranteed level of reliability. The superiority of the proposed mapping solution is illustrated by both extensive simulation-based performance comparisons with existing algorithms and experimental results from a real-life scientific workflow deployed in wide-area networks.
  • Keywords
    data flow computing; data handling; fault tolerant computing; natural sciences computing; optimisation; software reliability; wide area networks; workflow management software; biobjective optimization problem; data flow; decentralized layer-oriented method; distributed scientific workflow; failure rate bound; faulty network; large-scale distributed data processing; next-generation scientific application; probabilistic failure; streaming data processing; wide-area network; workflow-based computing technology; Approximation methods; Complexity theory; Computational modeling; Computer network reliability; Optimization; Reliability; Throughput; Reliability; distributed computing; fault tolerance; frame rate; workflow mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications (HPCC), 2011 IEEE 13th International Conference on
  • Conference_Location
    Banff, AB
  • Print_ISBN
    978-1-4577-1564-8
  • Electronic_ISBN
    978-0-7695-4538-7
  • Type

    conf

  • DOI
    10.1109/HPCC.2011.52
  • Filename
    6063011