DocumentCode :
2579627
Title :
DataCloud: An Efficient Massive Data Mining and Analysis Framework on Large Clusters
Author :
Zhang, Guigang ; Li, Chao ; Zhang, Yong ; Xing, Chunxiao
Author_Institution :
Tsinghua Nat. Lab. for Inf. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2012
fDate :
16-18 Nov. 2012
Firstpage :
198
Lastpage :
203
Abstract :
With the development of cloud computing technologies, big data processing is becoming more and more important. How to mine and analyze massive data is facing a very big challenge. In this paper, we proposed an efficient massive data mining and analysis framework Data Cloud on large clusters. The most important part of Data Cloud is the Rabbit. It is a kind of massive data mining and analysis processing plan framework on the large clusters like the Pig and Hive. We make a detail analysis about the Rabbit plan.
Keywords :
cloud computing; data mining; DataCloud; Rabbit plan; analysis processing plan framework; data cloud computing technology; data processing; massive data mining; Companies; Computational modeling; Data mining; Distributed databases; Educational institutions; Rabbits; Cloud computing; DataCloud; Massive data management; Rabbit;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location :
Haikou
Print_ISBN :
978-1-4673-3054-1
Type :
conf
DOI :
10.1109/WISA.2012.26
Filename :
6385210
Link To Document :
بازگشت