DocumentCode :
3764750
Title :
A genetic algorithm based ensemble approach for categorical data clustering
Author :
Jyoti Prokash Goswami;Anjana Kakoti Mahanta
Author_Institution :
Dept. of Computer Applications, Assam Engineering College, Guwahati, India
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we propose a genetic algorithm based procedure to combine different clustering solutions obtained for the same data set to construct a relatively good solution. Clustering ensemble technique, alternatively also known as clustering aggregation or consensus clustering, considers the different individual solutions obtained and combines them into a single solution of better quality using a consensus function. In the genetic algorithm based consensus function proposed here, each cluster is represented using a single representative. A chromosome represents a set of cluster representatives for a particular clustering result. Single point crossover is made between the two cluster representatives of highest similarity value so that changes in the clusters of the selected pair of chromosomes are small i.e. the clustering solutions gradually converge to the optimal one. Mutation is performed in such a way that the properties of cluster representatives remain invariant. A new fitness measure is proposed to evaluate the fitness of the cluster representatives as well as of the individual chromosomes. Experiments are made on real life state-of the -art data sets and results are reported.
Keywords :
"Biological cells","Clustering algorithms","Genetic algorithms","Partitioning algorithms","Sociology","Statistics","Indexes"
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2015 Annual IEEE
Electronic_ISBN :
2325-9418
Type :
conf
DOI :
10.1109/INDICON.2015.7443450
Filename :
7443450
Link To Document :
بازگشت