Title :
In STechAH: An Autoscaling Scheme for Hadoop in the Private Cloud
Author :
Xueying Wang ; Zhihui Lu ; Jie Wu ; Tong Zhao ; Patrick Hung
Author_Institution :
Sch. of Comput. Sci., Fudan Univ. Shanghai, Shanghai, China
Abstract :
Research shows that in many cloud data centers, physical resources are not used efficiently and thereby cost extra overhead. To improve cost-effectiveness of resources in cloud data centers, running big data applications to share residual capacity is a practical solution. However, performance loss brought by resource competition and interference from different types of applications is the main challenge for us. In this paper, we design, implement and evaluate the InSTechAH, an auto scaling scheme for a Hadoop system in a private cloud, which attempts to improve the resource utilization in cloud data centers as well as to maintain required quality of services by auto scaling and scheduling background analytics tasks. In this system, we design the multilayer node model to reduce interference from other services by automatically scaling the clusters according to the auto scale algorithm we introduced. We then build the resource scheduling model which use prediction based scheduling method to reduce the cost brought by scaling. We evaluate our scheme partly on a real data trace and partly on simulation, with Hadoop as the parallel data analytics frameworks and Open Stack as the cloud management architecture, to show the efficiency of InSTechAH system.
Keywords :
Big Data; cloud computing; computer centres; cost reduction; parallel processing; quality of service; scheduling; Hadoop system; InSTechAH system; Open Stack; auto scale algorithm; auto scaling scheme; autoscaling scheme; background analytics task; big data application; cloud data center; cloud management architecture; cost reduction; multilayer node model; parallel data analytics frameworks; prediction based scheduling method; private cloud; quality of service; residual capacity; resource competition; resource scheduling model; resource utilization; Cloud computing; Clustering algorithms; Data analysis; Monitoring; Predictive models; Resource management; Servers; Cloud computing; Hadoop; Openstack; Private Cloud; autoscale; cost effectiveness; datacenter;
Conference_Titel :
Services Computing (SCC), 2015 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4673-7280-0
DOI :
10.1109/SCC.2015.61