• DocumentCode
    2593613
  • Title

    Learning Pairwise Similarity for Data Clustering

  • Author

    Fred, Ana L N ; Jain, Anil K.

  • Author_Institution
    Inst. de Telecomunicacoes, Inst. Superior Tecnico, Lisbon
  • Volume
    1
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    925
  • Lastpage
    928
  • Abstract
    Each clustering algorithm induces a similarity between given data points, according to the underlying clustering criteria. Given the large number of available clustering techniques, one is faced with the following questions: (a) Which measure of similarity should be used in a given clustering problem? (b) Should the same similarity measure be used throughout the d-dimensional feature space? In other words, are the underlying clusters in given data of similar shape? Our goal is to learn the pairwise similarity between points in order to facilitate a proper partitioning of the data without the a priori knowledge of k, the number of clusters, and of the shape of these clusters. We explore a clustering ensemble approach combined with cluster stability criteria to selectively learn the similarity from a collection of different clustering algorithms with various parameter configurations
  • Keywords
    learning (artificial intelligence); pattern clustering; cluster stability criteria; clustering ensemble approach; data clustering; data partitioning; pairwise similarity learning; similarity measure; Clustering algorithms; Clustering methods; Computer science; Data engineering; Extraterrestrial measurements; Partitioning algorithms; Robustness; Shape; Stability criteria; Telecommunications;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2521-0
  • Type

    conf

  • DOI
    10.1109/ICPR.2006.754
  • Filename
    1699041