DocumentCode
2593613
Title
Learning Pairwise Similarity for Data Clustering
Author
Fred, Ana L N ; Jain, Anil K.
Author_Institution
Inst. de Telecomunicacoes, Inst. Superior Tecnico, Lisbon
Volume
1
fYear
0
fDate
0-0 0
Firstpage
925
Lastpage
928
Abstract
Each clustering algorithm induces a similarity between given data points, according to the underlying clustering criteria. Given the large number of available clustering techniques, one is faced with the following questions: (a) Which measure of similarity should be used in a given clustering problem? (b) Should the same similarity measure be used throughout the d-dimensional feature space? In other words, are the underlying clusters in given data of similar shape? Our goal is to learn the pairwise similarity between points in order to facilitate a proper partitioning of the data without the a priori knowledge of k, the number of clusters, and of the shape of these clusters. We explore a clustering ensemble approach combined with cluster stability criteria to selectively learn the similarity from a collection of different clustering algorithms with various parameter configurations
Keywords
learning (artificial intelligence); pattern clustering; cluster stability criteria; clustering ensemble approach; data clustering; data partitioning; pairwise similarity learning; similarity measure; Clustering algorithms; Clustering methods; Computer science; Data engineering; Extraterrestrial measurements; Partitioning algorithms; Robustness; Shape; Stability criteria; Telecommunications;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.754
Filename
1699041
Link To Document