DocumentCode
2333226
Title
Scalable evolutionary clustering algorithm with Self Adaptive Genetic Operators
Author
León, Elizabeth ; Nasraoui, Olfa ; Gomez, Jonatan
Author_Institution
Dept. of Comput. Eng., Univ. Nac. de Colombia, Bogota, Colombia
fYear
2010
fDate
18-23 July 2010
Firstpage
1
Lastpage
8
Abstract
In this paper, we present a scalable evolutionary algorithm for clustering large and dynamic data sets, called Scalable Evolutionary Clustering with Self Adaptive Genetic Operators (Scalable ECSAGO). The proposed evolutionary clustering algorithm can adapt its genetic operators rate while the evolution leads to the optimal centers of the clusters. The sizes of the clusters are estimated using a hybrid analytical optimization procedure. Moreover, a memorization factor is introduced in order to allow the algorithm to keep as much of the previously discovered knowledge about clusters and data summarization as desired. The proposed scalable ECSAGO algorithm is able to find accurate representations of the clusters on very large data sets of different sizes and dimensionality that might not fit in main memory, while maintaining the desirable properties of robustness to noise and automatic detection of the number of clusters. The algorithm is also useful for traking evolving cluster structures that change with the passage of time.
Keywords
data analysis; data mining; data structures; evolutionary computation; pattern clustering; data structures; data summarization; knowledge discovery; memorization factor; scalable evolutionary clustering algorithm; self adaptive genetic operator; Clustering algorithms; Data mining; Estimation; Genetics; Heuristic algorithms; Kernel; Prototypes;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2010 IEEE Congress on
Conference_Location
Barcelona
Print_ISBN
978-1-4244-6909-3
Type
conf
DOI
10.1109/CEC.2010.5586467
Filename
5586467
Link To Document