• DocumentCode
    967931
  • Title

    Semisupervised Clustering with Metric Learning using Relative Comparisons

  • Author

    Kumar, Nimit ; Kummamuru, Krishna

  • Author_Institution
    Gridstone Res., Mumbai
  • Volume
    20
  • Issue
    4
  • fYear
    2008
  • fDate
    4/1/2008 12:00:00 AM
  • Firstpage
    496
  • Lastpage
    503
  • Abstract
    Semisupervised clustering algorithms partition a given data set using limited supervision from the user. The success of these algorithms depends on the type of supervision and also on the kind of dissimilarity measure used while creating partitions of the space. This paper proposes a clustering algorithm that uses supervision in terms of relative comparisons, viz., x is closer to y than to z. The proposed clustering algorithm simultaneously learns the underlying dissimilarity measure while finding compact clusters in the given data set using relative comparisons. Through our experimental studies on high-dimensional textual data sets, we demonstrate that the proposed algorithm achieves higher accuracy and is more robust than similar algorithms using pairwise constraints for supervision.
  • Keywords
    learning (artificial intelligence); pattern clustering; high-dimensional textual data sets; metric learning; semisupervised clustering; space partitions; Clustering; Constraint-based Clustering; Dissimilarity Measures; Semi-supervised learning;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.190715
  • Filename
    4378399