Title :
Automatic aspect discrimination in relational data clustering
Author :
Horta, Danilo ; Campello, Ricardo J G B
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Carlos, Brazil
Abstract :
The features describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that performs fuzzy clustering and aspects weighting simultaneously was recently proposed. However, there are several situations where the data set is represented by proximity matrices only (relational data), which renders several clustering approaches, including SCAD, inappropriate. To handle this kind of data, the relational clustering algorithm CARD, based on the SCAD algorithm, has been recently developed. However, CARD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to also reduce the number of parameters required. The improved CARD is assessed over hundreds of real and artificial data sets.
Keywords :
matrix algebra; pattern clustering; relational databases; set theory; unsupervised learning; CARD; SCAD; artificial data sets; automatic aspect discrimination; fuzzy clustering; proximity matrices; real data sets; relational data clustering; unsupervised algorithm; Algorithm design and analysis; Clustering algorithms; Equations; Indexes; Mathematical model; Noise; Vectors; aspect discrimination; feature selection; fuzzy clustering; relational clustering;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
Print_ISBN :
978-1-4577-1676-8
DOI :
10.1109/ISDA.2011.6121709