Title :
Genetic programming-based clustering using an information theoretic fitness measure
Author :
Boric, Neven ; Estévez, Pablo A.
Author_Institution :
Univ. de Chile, Santiago
Abstract :
A clustering method based on multitree genetic programming and an information theoretic fitness is proposed. A probabilistic interpretation is given to the output of trees that does not require a conflict resolution phase. The method can cluster data with irregular shapes, estimate the underlying models of the data for each class and use those models to classify unseen patterns. The proposed scheme is tested on several real and artificial data sets, outperforming k-means algorithm in all of them.
Keywords :
data handling; genetic algorithms; information theory; pattern clustering; probability; trees (mathematics); data clustering; information theoretic fitness measure; multitree genetic programming; probabilistic interpretation; Backpropagation; Clustering algorithms; Clustering methods; Encoding; Entropy; Genetic algorithms; Genetic programming; Particle swarm optimization; Partitioning algorithms; Shape measurement;
Conference_Titel :
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-1339-3
Electronic_ISBN :
978-1-4244-1340-9
DOI :
10.1109/CEC.2007.4424451