DocumentCode :
3406198
Title :
Deploying and researching Hadoop in virtual machines
Author :
Xu, Guanghui ; Xu, Feng ; Ma, Hongxu
Author_Institution :
Coll. of Comput. & Inf., Hohai Univ., Nanjing, China
fYear :
2012
fDate :
15-17 Aug. 2012
Firstpage :
395
Lastpage :
399
Abstract :
Hadoop´s emerging and the maturity of virtualization make it feasible to combine them together to process immense data set. To do research on Hadoop in virtual environment, an experimental environment is needed. This paper firstly introduces some technologies used such as CloudStack, MapReduce and Hadoop. Based on that, a method to deploy CloudStack is given. Then we discuss how to deploy Hadoop in virtual machines which can be obtained from CloudStack by some means, then an algorithm to solve the problem that all the virtual machines which are created by CloudStack using same template have a same hostname. After that we run some Hadoop programs under the virtual cluster, which shows that it is feasible to deploying Hadoop in this way. Then some methods to optimize Hadoop in virtual machines are discussed. From this paper, readers can follow it to set up their own Hadoop experimental environment and capture the current status and trend of optimizing Hadoop in virtual environment.
Keywords :
cloud computing; public domain software; virtual machines; virtualisation; CloudStack; Hadoop experimental environment; Hadoop programs; MapReduce; immense data set processing; virtual cluster; virtual environment; virtual machines; virtualization; Cloud computing; Java; Programming; Servers; Virtual machining; CloudStack; Hadoop; MapReduce; Virtualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automation and Logistics (ICAL), 2012 IEEE International Conference on
Conference_Location :
Zhengzhou
ISSN :
2161-8151
Print_ISBN :
978-1-4673-0362-0
Electronic_ISBN :
2161-8151
Type :
conf
DOI :
10.1109/ICAL.2012.6308241
Filename :
6308241
Link To Document :
بازگشت