• DocumentCode
    3530369
  • Title

    An improved genetic algorithm for robust fuzzy clustering with unknown number of clusters

  • Author

    Banerjee, Amit

  • Author_Institution
    Sch. of Sci., Eng. & Technol., Pennsylvania State Univ. at Harrisburg, Middletown, PA, USA
  • fYear
    2010
  • fDate
    12-14 July 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper the problem of partitioning noisy data when the number of clusters c is not known a priori is revisited. The methodology proposed is a population-based search in the partition space using a genetic algorithm. Potential solutions are represented using a two-part representation scheme, where the first part of the chromosome represents the classification of the data into true (retained) and outlier (trimmed) sets, and the second part is the result of a partition on the true set for a particular value of c, which is simultaneously optimized by the process. A two-tier fitness function is also proposed in this paper, one which first assesses potential solutions on the basis of a test of clustering tendency on the retained set, and later on the efficacy of the partition for a given value of c. A mating pool is created out of highly successful individuals from the test of clustering tendency and allowed to crossover and produce offspring solutions which inherit the better partition from either of its parents. The proposed methodology is an improvement over a multi-objective genetic algorithm-based clustering technique, which previously was shown to be superior (or at least comparable) to robust clustering methods that assume a known value of c.
  • Keywords
    fuzzy set theory; genetic algorithms; pattern clustering; genetic algorithm; mating pool; multiobjective genetic algorithm based clustering technique; noisy data partitioning; population based search; robust fuzzy clustering; two-tier fitness function; CMOS technology; Delay; Delta-sigma modulation; Frequency conversion; Genetic algorithms; Robustness; Time domain analysis; Voltage; Voltage-controlled oscillators; Wideband; FCM; cluster validity; clustering tendency; genetic algorithms; robust clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Information Processing Society (NAFIPS), 2010 Annual Meeting of the North American
  • Conference_Location
    Toronto, ON
  • Print_ISBN
    978-1-4244-7859-0
  • Electronic_ISBN
    978-1-4244-7857-6
  • Type

    conf

  • DOI
    10.1109/NAFIPS.2010.5548175
  • Filename
    5548175