• DocumentCode
    3315433
  • Title

    A k-Means-Based Projected Clustering Algorithm

  • Author

    Sun, Yufen ; Liu, Gang ; Xu, Kun

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Wuhan Univ. of Technol., Wuhan, China
  • Volume
    1
  • fYear
    2010
  • fDate
    28-31 May 2010
  • Firstpage
    466
  • Lastpage
    470
  • Abstract
    In high dimensional data space, clusters are likely to exist in different subspaces. K-means is a classic clustering algorithm, but it cannot be used to find subspace clusters. In this paper, an algorithm called GKM is designed to generalize k-means algorithm for high dimensional data. In the objective function of GKM, we associate a weight vector with each cluster to indicate which dimensions are relevant to this cluster. To prevent the value of the objective function from decreasing because of the elimination of dimensions, virtual dimensions are added to the objective function. The values of data points on virtual dimensions are set artificially to ensure that the objective function is minimized when the real subspace clusters or the clusters in original space are found. Algorithm GKM preserves the advantages of k-means. It can identify subspace clusters with linear time complexity. Our performance study with a synthetic dataset and a real dataset demonstrates the efficiency and effectiveness of GKM.
  • Keywords
    data mining; pattern clustering; generalize k-means algorithm; k-means clustering; objective function; projected clustering algorithm; virtual dimensions; weight vector; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Intelligent transportation systems; Iterative algorithms; Machine learning algorithms; Partitioning algorithms; Space technology; Sun; data mining; high dimensions; k-means; projected clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Optimization (CSO), 2010 Third International Joint Conference on
  • Conference_Location
    Huangshan, Anhui
  • Print_ISBN
    978-1-4244-6812-6
  • Electronic_ISBN
    978-1-4244-6813-3
  • Type

    conf

  • DOI
    10.1109/CSO.2010.119
  • Filename
    5533167