• DocumentCode
    2687749
  • Title

    Genetic programming-based clustering using an information theoretic fitness measure

  • Author

    Boric, Neven ; Estévez, Pablo A.

  • Author_Institution
    Univ. de Chile, Santiago
  • fYear
    2007
  • fDate
    25-28 Sept. 2007
  • Firstpage
    31
  • Lastpage
    38
  • Abstract
    A clustering method based on multitree genetic programming and an information theoretic fitness is proposed. A probabilistic interpretation is given to the output of trees that does not require a conflict resolution phase. The method can cluster data with irregular shapes, estimate the underlying models of the data for each class and use those models to classify unseen patterns. The proposed scheme is tested on several real and artificial data sets, outperforming k-means algorithm in all of them.
  • Keywords
    data handling; genetic algorithms; information theory; pattern clustering; probability; trees (mathematics); data clustering; information theoretic fitness measure; multitree genetic programming; probabilistic interpretation; Backpropagation; Clustering algorithms; Clustering methods; Encoding; Entropy; Genetic algorithms; Genetic programming; Particle swarm optimization; Partitioning algorithms; Shape measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-1339-3
  • Electronic_ISBN
    978-1-4244-1340-9
  • Type

    conf

  • DOI
    10.1109/CEC.2007.4424451
  • Filename
    4424451