Learning Pairwise Similarity for Data Clustering

Author

Fred, Ana L N ; Jain, Anil K.

Author_Institution

Inst. de Telecomunicacoes, Inst. Superior Tecnico, Lisbon

Volume

1

fYear

0

fDate

0-0 0

Firstpage

925

Lastpage

928

Abstract

Each clustering algorithm induces a similarity between given data points, according to the underlying clustering criteria. Given the large number of available clustering techniques, one is faced with the following questions: (a) Which measure of similarity should be used in a given clustering problem? (b) Should the same similarity measure be used throughout the d-dimensional feature space? In other words, are the underlying clusters in given data of similar shape? Our goal is to learn the pairwise similarity between points in order to facilitate a proper partitioning of the data without the a priori knowledge of k, the number of clusters, and of the shape of these clusters. We explore a clustering ensemble approach combined with cluster stability criteria to selectively learn the similarity from a collection of different clustering algorithms with various parameter configurations

Keywords

learning (artificial intelligence); pattern clustering; cluster stability criteria; clustering ensemble approach; data clustering; data partitioning; pairwise similarity learning; similarity measure; Clustering algorithms; Clustering methods; Computer science; Data engineering; Extraterrestrial measurements; Partitioning algorithms; Robustness; Shape; Stability criteria; Telecommunications;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2006. ICPR 2006. 18th International Conference on

Conference_Location

Hong Kong

ISSN

1051-4651

Print_ISBN

0-7695-2521-0

Type

conf

DOI

10.1109/ICPR.2006.754

Filename

1699041