• DocumentCode
    3500779
  • Title

    Research and application of cluster analysis algorithm

  • Author

    Hailong Chen ; Chunli Liu

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Univ. of Sci. & Technol., Harbin, China
  • Volume
    01
  • fYear
    2013
  • fDate
    16-18 Aug. 2013
  • Firstpage
    575
  • Lastpage
    579
  • Abstract
    With the popularity of database technology matures and data applications, the amount of data accumulated by the human increases rapidly. Facing the extremely large amount of data, we gradually step into a “rich data, poor knowledge” embarrassing situation. The data mining (Data Mining) rise to solve this problem. In this paper, we study the means and methods of clustering analysis that processing data partition or grouping, which is an important field in data mining. Based on the understanding of theoretical basis of clustering analysis, firstly, analyze in detail main algorithms of partitioning methods, hierarchical methods, density-based methods, grid-based methods and model-based methods. Secondly, compare performance of different clustering algorithms from scalability, the shape of cluster, sensitivity to the “noise”, and sensitivity to the data input sequence, high dimension and algorithm efficiency. Finally, use MATLAB for simulating and verifying applications of the algorithms based on K-means clustering analysis and hierarchical clustering.
  • Keywords
    data mining; database management systems; formal verification; pattern clustering; statistical analysis; K-means clustering analysis algorithm; MATLAB; data accumulation; data applications; data mining; data partition processing; database technology; density-based methods; grid-based methods; hierarchical clustering methods; model-based methods; multivariate statistical method; partitioning methods; Adaptation models; Algorithm design and analysis; Analytical models; Clustering algorithms; Data models; Databases; Educational institutions; Clustering Analysis; Data Mining; Hierarchical Methods; K-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Measurement, Information and Control (ICMIC), 2013 International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4799-1390-9
  • Type

    conf

  • DOI
    10.1109/MIC.2013.6758030
  • Filename
    6758030