Title :
Combining Fuzzy Clustering with ANN Classifier for Categorical Data
Author :
Saha, Indrajit ; Mukhopadhyay, Anirban ; Maulik, Ujjwal
Abstract :
This article deals with the development of an improved clustering technique for categorical data that is based on the identification of points having significant membership to multiple classes. Cluster assignments of such points are difficult, and they often affect the actual partitioning of the data. As a consequence, it may be more effective if the points that are associated with maximum confusion regarding their cluster assignments are first identified and excluded from consideration at the first stage of algorithm and these points may be assigned to one of the identified clusters based on an ANN classifier at the second stage of this algorithm. At the first stage of this algorithm we are using our developed genetic algorithm and simulated annealing based fuzzy clustering and well known fuzzy C-medoids algorithm when the number of clusters is known a priori. The performance of the proposed clustering algorithms has been compared with the average linkage hierarchical clustering algorithm, in addition to the genetic algorithm based fuzzy clustering, simulated annealing based fuzzy clustering and fuzzy C-medoids with ANN for a variety of artificial and real life categorical data sets. Also statistical significance test have been performed to establish the superiority of the proposed algorithm.
Keywords :
fuzzy set theory; genetic algorithms; neural nets; pattern classification; pattern clustering; simulated annealing; statistical testing; artificial neural network classifier; categorical data set; fuzzy C-medoids algorithm; fuzzy clustering technique; genetic algorithm; simulated annealing; statistical significance test; Artificial neural networks; Clustering algorithms; Couplings; Fuzzy neural networks; Fuzzy sets; Genetic algorithms; Partitioning algorithms; Performance evaluation; Simulated annealing; Testing; Artificial neural network classifier; Fuzzy clustering; Genetic algorithm; Multi-class membership; Simulated annealing; Statistical significance test;
Conference_Titel :
Advance Computing Conference, 2009. IACC 2009. IEEE International
Conference_Location :
Patiala
Print_ISBN :
978-1-4244-2927-1
Electronic_ISBN :
978-1-4244-2928-8
DOI :
10.1109/IADCC.2009.4808978