• DocumentCode
    3093957
  • Title

    Inverse-degree Sampling for Spectral Clustering

  • Author

    Gao, Haidong ; Zhuang, Yueting ; Wu, Fei ; Shao, Jian

  • Author_Institution
    Coll. of Comput. Sci., Zhejiang Univ., Hang Zhou, China
  • fYear
    2011
  • fDate
    12-15 Aug. 2011
  • Firstpage
    362
  • Lastpage
    367
  • Abstract
    Among those classical clustering algorithms, spectral clustering performs much better than K-means in most cases. However, for the sake of cubic time complexity, spectral clustering is hardly used for clustering large-scale data sets. Therefore, sampling-based methods such as Nystrom method and Column sampling are respectively conducted as potential approaches to tackle this challenge. As we know, current sampling-based methods often utilize the uniform or other random sampling policies to select representative data and tend to disregard the data in small size clusters. This paper proposes an unbiased sampling framework, derives a new sampling method called inverse-degree sampling and then introduces an entropy criterion to prove it in theory simply. According to the selection of representative data by inverse-degree sampling in spectral clustering, the time complexity of spectral clustering becomes quadratic. Experiments on both toy data and real-world data demonstrate both the good sampling performance and the comparable clustering quality.
  • Keywords
    computational complexity; entropy; pattern clustering; sampling methods; Nystrom method; column sampling; cubic time complexity; entropy criterion; inverse-degree sampling; large-scale data sets clustering; random sampling policy; sampling-based method; spectral clustering; unbiased sampling framework; uniform sampling policy; Accuracy; Approximation methods; Clustering algorithms; Entropy; Kernel; Matrix decomposition; Sampling methods; degree; sampling; spectral clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Graphics (ICIG), 2011 Sixth International Conference on
  • Conference_Location
    Hefei, Anhui
  • Print_ISBN
    978-1-4577-1560-0
  • Electronic_ISBN
    978-0-7695-4541-7
  • Type

    conf

  • DOI
    10.1109/ICIG.2011.54
  • Filename
    6005587