DocumentCode
3093957
Title
Inverse-degree Sampling for Spectral Clustering
Author
Gao, Haidong ; Zhuang, Yueting ; Wu, Fei ; Shao, Jian
Author_Institution
Coll. of Comput. Sci., Zhejiang Univ., Hang Zhou, China
fYear
2011
fDate
12-15 Aug. 2011
Firstpage
362
Lastpage
367
Abstract
Among those classical clustering algorithms, spectral clustering performs much better than K-means in most cases. However, for the sake of cubic time complexity, spectral clustering is hardly used for clustering large-scale data sets. Therefore, sampling-based methods such as Nystrom method and Column sampling are respectively conducted as potential approaches to tackle this challenge. As we know, current sampling-based methods often utilize the uniform or other random sampling policies to select representative data and tend to disregard the data in small size clusters. This paper proposes an unbiased sampling framework, derives a new sampling method called inverse-degree sampling and then introduces an entropy criterion to prove it in theory simply. According to the selection of representative data by inverse-degree sampling in spectral clustering, the time complexity of spectral clustering becomes quadratic. Experiments on both toy data and real-world data demonstrate both the good sampling performance and the comparable clustering quality.
Keywords
computational complexity; entropy; pattern clustering; sampling methods; Nystrom method; column sampling; cubic time complexity; entropy criterion; inverse-degree sampling; large-scale data sets clustering; random sampling policy; sampling-based method; spectral clustering; unbiased sampling framework; uniform sampling policy; Accuracy; Approximation methods; Clustering algorithms; Entropy; Kernel; Matrix decomposition; Sampling methods; degree; sampling; spectral clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Graphics (ICIG), 2011 Sixth International Conference on
Conference_Location
Hefei, Anhui
Print_ISBN
978-1-4577-1560-0
Electronic_ISBN
978-0-7695-4541-7
Type
conf
DOI
10.1109/ICIG.2011.54
Filename
6005587
Link To Document