Title :
Data clustering based on approach of genetic algorithm
Author :
Wang, Hai-Hui ; Zhao, Wen-jie
Author_Institution :
Sch. of Comput. Sci. & Eng., Wuhan Inst. of Technol., Wuhan
Abstract :
Data clustering has been an active research area in the data mining community, and genetic algorithms have been used in a wide variety of fields to perform clustering. An efficient genetic algorithm for clustering on very large data sets is proposed in this paper. This algorithm can not only deal with higher local constringency speed and stronger global fast search, but also get down to the obstacles constraints and practicalities of large data clustering. The results on real datasets show that the algorithm performs better than the other algorithm. We also test this algorithm on artificial data sets, which are also large size. The experimental results show that our algorithm outperforms the algorithm in terms of running time as well as the quality of the clustering.
Keywords :
data mining; genetic algorithms; data clustering; data mining; genetic algorithm; Clustering algorithms; Clustering methods; Computer science; Data engineering; Data mining; Electronic mail; Genetic algorithms; Genetic engineering; Geography; Space technology; Data Clustering; Data Mining; Genetic Algorithm;
Conference_Titel :
Control and Decision Conference, 2008. CCDC 2008. Chinese
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-1733-9
Electronic_ISBN :
978-1-4244-1734-6
DOI :
10.1109/CCDC.2008.4597827