Title :
Speculative pipelining for compute cloud programming
Author :
Kung, H.T. ; Lin, Chit-Kwan ; Vlah, Dario ; Scorza, Giovanni Berlanda
Author_Institution :
Harvard Univ. Cambridge, Cambridge, MA, USA
fDate :
Oct. 31 2010-Nov. 3 2010
Abstract :
MapReduce job execution typically occurs in sequential phases of parallel steps. These phases can experience unpredictable delays when available computing and network capacities fluctuate or when there are large disparities in inter-node communication delays, as can occur on shared compute clouds. We propose a pipeline-based scheduling strategy, called speculative pipelining, which uses speculative prefetching and computing to minimize execution delays in subsequent stages due to varying resource availability. Our proposed method can mask the time required to perform speculative operations by overlapping with other ongoing operations. We introduce the notion of “open-option” prefetching, which, via coding techniques, allows speculative prefetching to begin even before knowing exactly which input will be needed. On a compute cloud testbed, we apply speculative pipelining to the Hadoop sorting benchmark and show that sorting time is shortened significantly.
Keywords :
cloud computing; storage management; compute cloud programming; pipeline-based scheduling strategy; speculative pipelining; speculative prefetching; Availability; Bandwidth; Clouds; Delay; Pipeline processing; Prefetching; Programming;
Conference_Titel :
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-8178-1
DOI :
10.1109/MILCOM.2010.5680451