Title :
Multi-view clustering
Author :
Bickel, Steffen ; Scheffer, Tobias
Author_Institution :
Dept. of Comput. Sci., Humboldt-Univ. zu Berlin, Germany
Abstract :
We consider clustering problems in which the available attributes can be split into two independent subsets, such that either subset suffices for learning. Example applications of this multi-view setting include clustering of Web pages which have an intrinsic view (the pages themselves) and an extrinsic view (e.g., anchor texts of inbound hyperlinks); multi-view learning has so far been studied in the context of classification. We develop and study partitioning and agglomerative, hierarchical multi-view clustering algorithms for text data. We find empirically that the multi-view versions of k-means and EM greatly improve on their single-view counterparts. By contrast, we obtain negative results for agglomerative hierarchical multi-view clustering. Our analysis explains this surprising phenomenon.
Keywords :
data mining; learning (artificial intelligence); pattern classification; pattern clustering; set theory; text analysis; Web pages; agglomerative hierarchical multiview clustering; clustering algorithm; independent subsets; multiview learning; partitioning; text data; Data mining;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10095