• DocumentCode
    3124378
  • Title

    A Robust Clustering Algorithm Based on Aggregated Heat Kernel Mapping

  • Author

    Huang, Hao ; Yoo, Shinjae ; Qin, Hong ; Yu, Dantong

  • Author_Institution
    Dept. of Comput. Sci., Stony Brook Univ., Stony Brook, NY, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    270
  • Lastpage
    279
  • Abstract
    Current spectral clustering algorithms suffer from both sensitivity to scaling parameter selection in similarity matrix construction, and data perturbation. This paper aims to improve robustness in clustering algorithms and combat these two limitations based on heat kernel theory. Heat kernel can statistically depict traces of random walk, so it has an intrinsic connection with diffusion distance, with which we can ensure robustness during any clustering process. By integrating heat distributed along time scale, we propose a novel method called Aggregated Heat Kernel (AHK) to measure the distance between each point pair in their eigen space. Using AHK and Laplace-Beltrami Normalization (LBN) we are able to apply an advanced noise-resisting robust spectral mapping to original dataset. Moreover it offers stability on scaling parameter tuning. Experimental results show that, compared to other popular spectral clustering methods, our algorithm can achieve robust clustering results on both synthetic and UCI real datasets.
  • Keywords
    data mining; matrix algebra; pattern clustering; unsupervised learning; Laplace-Beltrami normalization; aggregated heat kernel mapping; data mining; data perturbation; diffusion distance; distance measurement; intrinsic connection; knowledge discovery; noise-resisting robust spectral mapping; robust clustering algorithm; scaling parameter selection; sensitivity parameter selection; similarity matrix construction; spectral clustering algorithms; unsupervised knowledge exploration; Clustering algorithms; Heating; Kernel; Laplace equations; Noise; Robustness; Sensitivity; Diffusion processes; Green´s function methods; Spectral analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.15
  • Filename
    6137231