DocumentCode
1669919
Title
In STechAH: An Autoscaling Scheme for Hadoop in the Private Cloud
Author
Xueying Wang ; Zhihui Lu ; Jie Wu ; Tong Zhao ; Patrick Hung
Author_Institution
Sch. of Comput. Sci., Fudan Univ. Shanghai, Shanghai, China
fYear
2015
Firstpage
395
Lastpage
402
Abstract
Research shows that in many cloud data centers, physical resources are not used efficiently and thereby cost extra overhead. To improve cost-effectiveness of resources in cloud data centers, running big data applications to share residual capacity is a practical solution. However, performance loss brought by resource competition and interference from different types of applications is the main challenge for us. In this paper, we design, implement and evaluate the InSTechAH, an auto scaling scheme for a Hadoop system in a private cloud, which attempts to improve the resource utilization in cloud data centers as well as to maintain required quality of services by auto scaling and scheduling background analytics tasks. In this system, we design the multilayer node model to reduce interference from other services by automatically scaling the clusters according to the auto scale algorithm we introduced. We then build the resource scheduling model which use prediction based scheduling method to reduce the cost brought by scaling. We evaluate our scheme partly on a real data trace and partly on simulation, with Hadoop as the parallel data analytics frameworks and Open Stack as the cloud management architecture, to show the efficiency of InSTechAH system.
Keywords
Big Data; cloud computing; computer centres; cost reduction; parallel processing; quality of service; scheduling; Hadoop system; InSTechAH system; Open Stack; auto scale algorithm; auto scaling scheme; autoscaling scheme; background analytics task; big data application; cloud data center; cloud management architecture; cost reduction; multilayer node model; parallel data analytics frameworks; prediction based scheduling method; private cloud; quality of service; residual capacity; resource competition; resource scheduling model; resource utilization; Cloud computing; Clustering algorithms; Data analysis; Monitoring; Predictive models; Resource management; Servers; Cloud computing; Hadoop; Openstack; Private Cloud; autoscale; cost effectiveness; datacenter;
fLanguage
English
Publisher
ieee
Conference_Titel
Services Computing (SCC), 2015 IEEE International Conference on
Conference_Location
New York, NY
Print_ISBN
978-1-4673-7280-0
Type
conf
DOI
10.1109/SCC.2015.61
Filename
7207379
Link To Document