• DocumentCode
    1403874
  • Title

    Pairwise data clustering by deterministic annealing

  • Author

    Hofmann, Thomas ; Buhmann, Joachim M.

  • Author_Institution
    Inst. fur Inf. II, Rheinrich-Wilhelms Univ., Bonn, Germany
  • Volume
    19
  • Issue
    1
  • fYear
    1997
  • fDate
    1/1/1997 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    14
  • Abstract
    Partitioning a data set and extracting hidden structure from the data arises in different application areas of pattern recognition, speech and image processing. Pairwise data clustering is a combinatorial optimization method for data grouping which extracts hidden structure from proximity data. We describe a deterministic annealing approach to pairwise clustering which shares the robustness properties of maximum entropy inference. The resulting Gibbs probability distributions are estimated by mean-field approximation. A new structure-preserving algorithm to cluster dissimilarity data and to simultaneously embed these data in a Euclidian vector space is discussed which can be used for dimensionality reduction and data visualization. The suggested embedding algorithm which outperforms conventional approaches has been implemented to analyze dissimilarity data from protein analysis and from linguistics. The algorithm for pairwise data clustering is used to segment textured images
  • Keywords
    image processing; inference mechanisms; maximum entropy methods; pattern recognition; probability; simulated annealing; speech processing; Euclidian vector space; Gibbs probability distributions; combinatorial optimization method; data grouping; data set partitioning; data visualization; deterministic annealing; dimensionality reduction; dissimilarity data clustering; hidden structure extraction; image processing; linguistics; maximum entropy inference; mean-field approximation; pairwise data clustering; pattern recognition; protein analysis; robustness; speech processing; structure-preserving algorithm; textured image segmentation; Algorithm design and analysis; Annealing; Clustering algorithms; Data analysis; Data mining; Image processing; Inference algorithms; Optimization methods; Pattern recognition; Speech processing;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.566806
  • Filename
    566806