• DocumentCode
    1560323
  • Title

    Information theoretic clustering

  • Author

    Gokcay, Erhan ; Principe, Jose C.

  • Author_Institution
    Computational NeuroBiology Lab, Salk Inst., La Jolla, CA, USA
  • Volume
    24
  • Issue
    2
  • fYear
    2002
  • fDate
    2/1/2002 12:00:00 AM
  • Firstpage
    158
  • Lastpage
    171
  • Abstract
    Clustering is an important topic in pattern recognition. Since only the structure of the data dictates the grouping (unsupervised learning), information theory is an obvious criteria to establish the clustering rule. The paper describes a novel valley seeking clustering algorithm using an information theoretic measure to estimate the cost of partitioning the data set. The information theoretic criteria developed here evolved from a Renyi entropy estimator (A. Renyi, 1960) that was proposed recently and has been successfully applied to other machine learning applications (J.C. Principe et al., 2000). An improved version of the k-change algorithm is used in optimization because of the stepwise nature of the cost function and existence of local minima. Even when applied to nonlinearly separable data, the new algorithm performs well, and was able to find nonlinear boundaries between clusters. The algorithm is also applied to the segmentation of magnetic resonance imaging data (MRI) with very promising results
  • Keywords
    bibliographies; data handling; image segmentation; information theory; magnetic resonance imaging; optimisation; pattern clustering; unsupervised learning; MRI segmentation; Renyi entropy estimator; clustering rule; cost function; data set partitioning; information theoretic clustering; information theoretic criteria; information theoretic measure; k-change algorithm; local minima; machine learning applications; magnetic resonance imaging data segmentation; nonlinear boundaries; nonlinearly separable data; optimization; pattern recognition; stepwise nature; unsupervised learning; valley seeking clustering algorithm; Clustering algorithms; Costs; Entropy; Estimation theory; Information theory; Machine learning algorithms; Magnetic resonance imaging; Partitioning algorithms; Pattern recognition; Unsupervised learning;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.982897
  • Filename
    982897