DocumentCode :
63450
Title :
Scalable Constrained Spectral Clustering
Author :
Jianyuan Li ; Yingjie Xia ; Zhenyu Shan ; Yuncai Liu
Author_Institution :
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume :
27
Issue :
2
fYear :
2015
fDate :
Feb. 1 2015
Firstpage :
589
Lastpage :
593
Abstract :
Constrained spectral clustering (CSC) algorithms have shown great promise in significantly improving clustering accuracy by encoding side information into spectral clustering algorithms. However, existing CSC algorithms are inefficient in handling moderate and large datasets. In this paper, we aim to develop a scalable and efficient CSC algorithm by integrating sparse coding based graph construction into a framework called constrained normalized cuts. To this end, we formulate a scalable constrained normalized-cuts problem and solve it based on a closed-form mathematical analysis. We demonstrate that this problem can be reduced to a generalized eigenvalue problem that can be solved very efficiently. We also describe a principled k-way CSC algorithm for handling moderate and large datasets. Experimental results over benchmark datasets demonstrate that the proposed algorithm is greatly cost-effective, in the sense that (1) with less side information, it can obtain significant improvements in accuracy compared to the unsupervised baseline; (2) with less computational time, it can achieve high clustering accuracies close to those of the state-of-the-art.
Keywords :
eigenvalues and eigenfunctions; graph theory; pattern clustering; benchmark datasets; closed-form mathematical analysis; constrained normalized-cuts problem; constrained spectral clustering; generalized eigenvalue problem; principled k-way CSC algorithm; side information encoding; sparse coding based graph construction; Accuracy; Algorithm design and analysis; Clustering algorithms; Eigenvalues and eigenfunctions; Encoding; Sparse matrices; Vectors; Constrained spectral clustering; efficiency; scalability; sparse coding;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2014.2356471
Filename :
6895121
Link To Document :
بازگشت