• DocumentCode
    584287
  • Title

    Research on Clustering Algorithm for Massive Data Based on Hadoop Platform

  • Author

    Zhengqiao, Xu ; Dewei, Zhao

  • Author_Institution
    Exp. Center, China West Normal Univ., Nanchong, China
  • fYear
    2012
  • fDate
    11-13 Aug. 2012
  • Firstpage
    43
  • Lastpage
    45
  • Abstract
    With the concepts of cloud computing springing up, the researches of data mining clustering algorithm which is based on cloud computing become a research focus for scholars both at home and abroad. This article aiming at the extensive data clustering problem, using cloud computing technology, according to Hadoop platform does a deep research based on cloud computing platforms Hadoop and parallel K-means clustering algorithm. And it puts forward a kind of mass data clustering model based on Hadoop and new ideas of parallel K-means algorithm.
  • Keywords
    cloud computing; data mining; parallel algorithms; pattern clustering; public domain software; Hadoop platform; cloud computing technology; data mining clustering algorithm; mass data clustering model; parallel K-means clustering algorithm; Algorithm design and analysis; Cloud computing; Clustering algorithms; Computational modeling; Computers; Data mining; Educational institutions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science & Service System (CSSS), 2012 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4673-0721-5
  • Type

    conf

  • DOI
    10.1109/CSSS.2012.19
  • Filename
    6394257