Title :
Research on Clustering Algorithm for Massive Data Based on Hadoop Platform
Author :
Zhengqiao, Xu ; Dewei, Zhao
Author_Institution :
Exp. Center, China West Normal Univ., Nanchong, China
Abstract :
With the concepts of cloud computing springing up, the researches of data mining clustering algorithm which is based on cloud computing become a research focus for scholars both at home and abroad. This article aiming at the extensive data clustering problem, using cloud computing technology, according to Hadoop platform does a deep research based on cloud computing platforms Hadoop and parallel K-means clustering algorithm. And it puts forward a kind of mass data clustering model based on Hadoop and new ideas of parallel K-means algorithm.
Keywords :
cloud computing; data mining; parallel algorithms; pattern clustering; public domain software; Hadoop platform; cloud computing technology; data mining clustering algorithm; mass data clustering model; parallel K-means clustering algorithm; Algorithm design and analysis; Cloud computing; Clustering algorithms; Computational modeling; Computers; Data mining; Educational institutions;
Conference_Titel :
Computer Science & Service System (CSSS), 2012 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4673-0721-5
DOI :
10.1109/CSSS.2012.19