• DocumentCode
    1362448
  • Title

    Learning Bregman Distance Functions for Semi-Supervised Clustering

  • Author

    Wu, Lei ; Hoi, Steven C H ; Jin, Rong ; Zhu, Jianke ; Yu, Nenghai

  • Author_Institution
    Univ. of Sci. & Technol. of China, Hefei, China
  • Volume
    24
  • Issue
    3
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    478
  • Lastpage
    491
  • Abstract
    Learning distance functions with side information plays a key role in many data mining applications. Conventional distance metric learning approaches often assume that the target distance function is represented in some form of Mahalanobis distance. These approaches usually work well when data are in low dimensionality, but often become computationally expensive or even infeasible when handling high-dimensional data. In this paper, we propose a novel scheme of learning nonlinear distance functions with side information. It aims to learn a Bregman distance function using a nonparametric approach that is similar to Support Vector Machines. We emphasize that the proposed scheme is more general than the conventional approach for distance metric learning, and is able to handle high-dimensional data efficiently. We verify the efficacy of the proposed distance learning method with extensive experiments on semi-supervised clustering. The comparison with state-of-the-art approaches for learning distance functions with side information reveals clear advantages of the proposed technique.
  • Keywords
    learning (artificial intelligence); nonlinear functions; pattern clustering; Bregman distance function; data mining; distance metric learning; high-dimensional data handling; nonlinear distance function learning; semisupervised clustering; side information; support vector machines; Clustering algorithms; Convex functions; Kernel; Linear matrix inequalities; Measurement; Training; Training data; Bregman distance; convex functions.; distance functions; metric learning;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.215
  • Filename
    5611527