• DocumentCode
    1245653
  • Title

    Simulated annealing using a reversible jump Markov chain Monte Carlo algorithm for fuzzy clustering

  • Author

    Bandyopadhyay, Sanghamitra

  • Author_Institution
    Machine Intelligence Unit, Indian Stat. Inst., Kolkata, India
  • Volume
    17
  • Issue
    4
  • fYear
    2005
  • fDate
    4/1/2005 12:00:00 AM
  • Firstpage
    479
  • Lastpage
    490
  • Abstract
    In this paper, an approach for automatically clustering a data set into a number of fuzzy partitions with a simulated annealing using a reversible jump Markov chain Monte Carlo algorithm is proposed. This is in contrast to the widely used fuzzy clustering scheme, the fuzzy c-means (FCM) algorithm, which requires the a priori knowledge of the number of clusters. The said approach performs the clustering by optimizing a cluster validity index, the Xie-Beni index. It makes use of the homogeneous reversible jump Markov chain Monte Carlo (RJMCMC) kernel as the proposal so that the algorithm is able to jump between different dimensions, i.e., number of clusters, until the correct value is obtained. Different moves, like birth, death, split, merge, and update, are used for sampling a candidate state given the current state. The effectiveness of the proposed technique in optimizing the Xie-Beni index and thereby determining the appropriate clustering is demonstrated for both artificial and real-life data sets. In a part of the investigation, the utility of the fuzzy clustering scheme for classifying pixels in an IRS satellite image of Kolkata is studied. A technique for reducing the computation efforts in the case of satellite image data is incorporated.
  • Keywords
    Markov processes; Monte Carlo methods; pattern clustering; simulated annealing; unsupervised learning; cluster validity index; fuzzy clustering; pattern recognition; remote sensing; reversible jump Markov chain Monte Carlo algorithm; satellite image data; simulated annealing; Clustering algorithms; Fuzzy sets; Kernel; Monte Carlo methods; Partitioning algorithms; Pixel; Proposals; Sampling methods; Satellites; Simulated annealing; Index Terms- Pattern recognition; Reversible Jump Markov Chain Monte Carlo; cluster validity index; determining the number of clusters; fuzzy clustering; remote sensing.; simulated annealing;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.64
  • Filename
    1401888