DocumentCode
1942154
Title
Speculative pipelining for compute cloud programming
Author
Kung, H.T. ; Lin, Chit-Kwan ; Vlah, Dario ; Scorza, Giovanni Berlanda
Author_Institution
Harvard Univ. Cambridge, Cambridge, MA, USA
fYear
2010
fDate
Oct. 31 2010-Nov. 3 2010
Firstpage
2026
Lastpage
2034
Abstract
MapReduce job execution typically occurs in sequential phases of parallel steps. These phases can experience unpredictable delays when available computing and network capacities fluctuate or when there are large disparities in inter-node communication delays, as can occur on shared compute clouds. We propose a pipeline-based scheduling strategy, called speculative pipelining, which uses speculative prefetching and computing to minimize execution delays in subsequent stages due to varying resource availability. Our proposed method can mask the time required to perform speculative operations by overlapping with other ongoing operations. We introduce the notion of “open-option” prefetching, which, via coding techniques, allows speculative prefetching to begin even before knowing exactly which input will be needed. On a compute cloud testbed, we apply speculative pipelining to the Hadoop sorting benchmark and show that sorting time is shortened significantly.
Keywords
cloud computing; storage management; compute cloud programming; pipeline-based scheduling strategy; speculative pipelining; speculative prefetching; Availability; Bandwidth; Clouds; Delay; Pipeline processing; Prefetching; Programming;
fLanguage
English
Publisher
ieee
Conference_Titel
MILITARY COMMUNICATIONS CONFERENCE, 2010 - MILCOM 2010
Conference_Location
San Jose, CA
ISSN
2155-7578
Print_ISBN
978-1-4244-8178-1
Type
conf
DOI
10.1109/MILCOM.2010.5680451
Filename
5680451
Link To Document