Title :
Correlational spectral clustering
Author :
Blaschko, Matthew B. ; Lampert, Christoph H.
Author_Institution :
Max Planck Inst. for Biol. Cybern., Tubingen
Abstract :
We present a new method for spectral clustering with paired data based on kernel canonical correlation analysis, called correlational spectral clustering. Paired data are common in real world data sources, such as images with text captions. Traditional spectral clustering algorithms either assume that data can be represented by a single similarity measure, or by co-occurrence matrices that are then used in biclustering. In contrast, the proposed method uses separate similarity measures for each data representation, and allows for projection of previously unseen data that are only observed in one representation (e.g. images but not text). We show that this algorithm generalizes traditional spectral clustering algorithms and show consistent empirical improvement over spectral clustering on a variety of datasets of images with associated text.
Keywords :
data structures; image processing; pattern clustering; co-occurrence matrices; correlational spectral clustering; data representation; data sources; kernel canonical correlation analysis; Clustering algorithms; Cybernetics; Humans; Kernel; Labeling; Linear discriminant analysis; Spatiotemporal phenomena; Spectral analysis; Testing; Video sequences;
Conference_Titel :
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4244-2242-5
Electronic_ISBN :
1063-6919
DOI :
10.1109/CVPR.2008.4587353