Title :
Elastic Allocator: An Adaptive Task Scheduler for Streaming Query in the Cloud
Author :
Zheng Han ; Rui Chu ; Haibo Mi ; Huaimin Wang
Author_Institution :
Sci. & Technol. on Parallel & Distrib. Process. Lab., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Many big data applications receive and process data in real time. These data, also known as data streams, are generated continuously and processed online in a low latency manner. Data stream is prone to change dramatically in volume, since its workload may have a variation of several orders between peak and valley periods. Fully provisioning resources for stream processing to handle the peak load is costly, while over-provisioning is wasteful when to deal with lightweight workload. Cloud computing emphasizes that resource should be utilized economically and elastically. An open question is how to allocate query task adaptively to keeping up the input rate of the data stream. Previous work focuses on using either local or global capacity information to improve the cluster CPU resource utilization, while the bandwidth utilization which is also critical to the system throughput is ignored or simplified. In this paper, we formalize the operator placement problem considering both the CPU and bandwidth usage, and introduce the Elastic Allocator. The Elastic Allocator uses a quantitative method to evaluate a node´s capacity and bandwidth usage, and exploit both the local and global resource information to allocate the query task in a graceful manner to achieve high resource utilization. The experimental results and a simple prototype built on top of Storm finally demonstrate that Elastic Allocator is adaptive and feasible in cloud computing environment, and has an advantage of improving and balancing system resource utilization.
Keywords :
Big Data; cloud computing; query processing; resource allocation; Big Data applications; CPU; adaptive task scheduler; bandwidth usage; cloud computing; data streams; elastic allocator; global resource information; local resource information; node capacity; operator placement problem; quantitative method; query streaming; query task allocation; resource utilization; Bandwidth; Cloud computing; Clustering algorithms; Computer architecture; Linear programming; Resource management; Storms; data stream; elastic; stream query; task allocation;
Conference_Titel :
Service Oriented System Engineering (SOSE), 2014 IEEE 8th International Symposium on
Conference_Location :
Oxford
DOI :
10.1109/SOSE.2014.40