DocumentCode :
3584940
Title :
Hadoop in OpenStack: Data-location-aware cluster provisioning
Author :
Thaha, Asmath F. ; Singh, Manvir ; Amin, Anang H. M. ; Ahmad, Nazrul M. ; Kannan, Subarmaniam
Author_Institution :
Thunder Cloud Res. Lab., Multimedia Univ. Melaka, Melaka, Malaysia
fYear :
2014
Firstpage :
296
Lastpage :
301
Abstract :
Nowadays, cloud based analytics platforms are replacing traditional physical clusters due to the high efficiency it provides. Such cloud platforms runs Hadoop on virtual clusters with remotely attached storage. In cloud architecture with multiple geographically separated regions, virtual machines (VMs) belonging to a virtual cluster are placed randomly. In order to run MapReduce jobs, data have to be moved to the regions where the VMs reside to achieve data locality. In this paper, we propose a data-location aware virtual cluster provisioning strategy to identify the data location and provision the cluster near to the storage. The use of bio-inspired optimization algorithms are considered for optimizing the placements of VMs. Data location aware cluster provisioning reduces the network distance between storage and the virtual cluster, resulting in faster job completion times.
Keywords :
cloud computing; optimisation; parallel programming; pattern clustering; virtual machines; Hadoop; MapReduce jobs; OpenStack; VM placement optimization; bio-inspired optimization algorithms; cloud architecture; cloud based analytics platforms; cloud platforms; data locality; data location aware cluster provisioning; data location identification; data provision identification; data storage; data-location aware virtual cluster provisioning strategy; data-location-aware cluster provisioning; job completion times; multiple geographically separated regions; network distance reduction; remotely attached storage; virtual cluster; virtual machines; Availability; Bandwidth; Cloud computing; Computer architecture; Distributed databases; Hardware; Switches; Bio-inspired algorithms; Cloud Computing; Cluster Provisioning; Data locality; Hadoop; MapReduce; OpenStack;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies (WICT), 2014 Fourth World Congress on
Print_ISBN :
978-1-4799-8114-4
Type :
conf
DOI :
10.1109/WICT.2014.7077282
Filename :
7077282
Link To Document :
بازگشت