Title :
Improving Throughput and Reliability of Distributed Scientific Workflows for Streaming Data Processing
Author :
Gu, Yi ; Wu, Qishi ; Liu, Xin ; Yu, Dantong
Author_Institution :
Dept. of Comput. Sci., Univ. of Memphis, Memphis, TN, USA
Abstract :
With the advent of next-generation scientific applications, the workflow-based computing technology has become an indispensable research method for managing and streamlining large-scale distributed data processing. This paper investigates a problem of mapping distributed workflows for streaming data processing in faulty networks where nodes and links are subject to probabilistic failures. We formulate this problem as a bi-objective optimization problem in terms of both throughput and reliability, and propose a decentralized layer-oriented method to achieve high throughput for smooth data flow while satisfying a prespecified overall failure rate bound for a guaranteed level of reliability. The superiority of the proposed mapping solution is illustrated by both extensive simulation-based performance comparisons with existing algorithms and experimental results from a real-life scientific workflow deployed in wide-area networks.
Keywords :
data flow computing; data handling; fault tolerant computing; natural sciences computing; optimisation; software reliability; wide area networks; workflow management software; biobjective optimization problem; data flow; decentralized layer-oriented method; distributed scientific workflow; failure rate bound; faulty network; large-scale distributed data processing; next-generation scientific application; probabilistic failure; streaming data processing; wide-area network; workflow-based computing technology; Approximation methods; Complexity theory; Computational modeling; Computer network reliability; Optimization; Reliability; Throughput; Reliability; distributed computing; fault tolerance; frame rate; workflow mapping;
Conference_Titel :
High Performance Computing and Communications (HPCC), 2011 IEEE 13th International Conference on
Conference_Location :
Banff, AB
Print_ISBN :
978-1-4577-1564-8
Electronic_ISBN :
978-0-7695-4538-7
DOI :
10.1109/HPCC.2011.52