Title :
Fault tolerant state management for high-volume low-latency data stream workloads
Author :
Muralidharan, K.B. ; Kumar, G. Sathish ; Bhasi, Marath
Author_Institution :
Dept. of Comput. Sci., Cochin Univ. of Sci. & Technol., Kochi, India
Abstract :
One of the major challenges in performing incremental computations on parallel distributed stream processing systems is in the implementation of a mechanism for passing state values across successive runs. One approach is to enhance the granularity from record-at-a-time processing to processing at micro-batch level. A contrasting approach is to follow the record-at-a-time semantics and ensure scalability by means of distributed state management. Both approaches, however, require observing high degree of fault tolerance. In this paper, we study the problem of process state management against non-terminating data stream workloads for low-latency computing using the micro-batch stream processing approach. We attempt to examine methods that could yield optimum levels of state retentions with high degree of fault tolerance for typical processing workloads and propose a three-pronged approach to harness the demand.
Keywords :
data handling; fault tolerant computing; parallel processing; distributed state management; fault tolerant state management; high-volume low-latency data stream workload; microbatch stream processing approach; parallel distributed stream processing systems; process state management; record-at-a-time processing; record-at-a-time semantics; Distributed databases; Educational institutions; Fault tolerance; Fault tolerant systems; Information filters; Scalability; Data stream processing; fault tolerance; micro-batch processing; state management;
Conference_Titel :
Data Science & Engineering (ICDSE), 2014 International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4799-6870-1
DOI :
10.1109/ICDSE.2014.6974606