مرکز منطقه ای اطلاع رساني علوم و فناوري - MSGKA: an efficient clustering algorithm for large databases

DocumentCode :

389624

Title :

MSGKA: an efficient clustering algorithm for large databases

Author :

Tsai, Cheng-Fa ; Chen, Zhi-Cheng ; Tsai, Chun-Wei

Author_Institution :

Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Taiwan

Volume :

fYear :

2002

fDate :

6-9 Oct. 2002

Abstract :

This investigation presents an efficient clustering algorithm for large databases. We present a novel multiple-searching genetic algorithm (MSGA) that finds a globally optimal partition of a given data into a specified number of clusters. We hybridize MSGA with a multiple-searching approach utilized in clustering namely, K-means algorithm. Hence, the name multiple-searching genetic K-means algorithm (MSGKA). Our simulation results reveal that the proposed novel clustering approach performs better than the Fast SOM combines K-means approach (FSOM+K-means) and Genetic K-Means Algorithm (GKA). Moreover, in all the cases we studied, our approach produces much smaller errors than both the FSOM+K-means and GKA.

Keywords :

data mining; database theory; genetic algorithms; pattern clustering; search problems; very large databases; Fast SOM combined K-means approach; Genetic K-Means Algorithm; K-means algorithm; MSGKA; clustering algorithm; data mining; errors; large databases; multiple-searching genetic K-means algorithm; multiple-searching genetic algorithm; simulation; Biological cells; Clustering algorithms; Costs; Data mining; Databases; Genetic algorithms; Machine learning; Partitioning algorithms; Pattern recognition; Statistics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man and Cybernetics, 2002 IEEE International Conference on

ISSN :

1062-922X

Print_ISBN :

0-7803-7437-1

Type :

conf

DOI :

10.1109/ICSMC.2002.1176400

Filename :

1176400

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=389624