DocumentCode
3530369
Title
An improved genetic algorithm for robust fuzzy clustering with unknown number of clusters
Author
Banerjee, Amit
Author_Institution
Sch. of Sci., Eng. & Technol., Pennsylvania State Univ. at Harrisburg, Middletown, PA, USA
fYear
2010
fDate
12-14 July 2010
Firstpage
1
Lastpage
6
Abstract
In this paper the problem of partitioning noisy data when the number of clusters c is not known a priori is revisited. The methodology proposed is a population-based search in the partition space using a genetic algorithm. Potential solutions are represented using a two-part representation scheme, where the first part of the chromosome represents the classification of the data into true (retained) and outlier (trimmed) sets, and the second part is the result of a partition on the true set for a particular value of c, which is simultaneously optimized by the process. A two-tier fitness function is also proposed in this paper, one which first assesses potential solutions on the basis of a test of clustering tendency on the retained set, and later on the efficacy of the partition for a given value of c. A mating pool is created out of highly successful individuals from the test of clustering tendency and allowed to crossover and produce offspring solutions which inherit the better partition from either of its parents. The proposed methodology is an improvement over a multi-objective genetic algorithm-based clustering technique, which previously was shown to be superior (or at least comparable) to robust clustering methods that assume a known value of c.
Keywords
fuzzy set theory; genetic algorithms; pattern clustering; genetic algorithm; mating pool; multiobjective genetic algorithm based clustering technique; noisy data partitioning; population based search; robust fuzzy clustering; two-tier fitness function; CMOS technology; Delay; Delta-sigma modulation; Frequency conversion; Genetic algorithms; Robustness; Time domain analysis; Voltage; Voltage-controlled oscillators; Wideband; FCM; cluster validity; clustering tendency; genetic algorithms; robust clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Information Processing Society (NAFIPS), 2010 Annual Meeting of the North American
Conference_Location
Toronto, ON
Print_ISBN
978-1-4244-7859-0
Electronic_ISBN
978-1-4244-7857-6
Type
conf
DOI
10.1109/NAFIPS.2010.5548175
Filename
5548175
Link To Document