Title :
Effects of Irrelevant Attributes in Fuzzy Clustering
Author :
Döring, Christian ; Borgelt, Christian ; Kruse, Rudolf
Author_Institution :
Dept. of Knowledge Process. & Language Eng., Otto-von-Guericke-Univ. of Magdeburg
Abstract :
In fuzzy clustering soft cluster partitions are formed based on the similarity of data points to the respective cluster prototypes. Similarity is defined in terms of simultaneous closeness regarding all attributes. In some applications the values of many attributes have been measured, but a natural clustering, if it exists, occurs within a (small) subset of attributes. The remaining dimensions can be considered irrelevant. They can obscure an existing grouping and make it harder to discover the cluster structure. In probabilistic fuzzy clustering irrelevant attributes can lead to coincidental cluster centers in the worst case. We study this effect in detail as well as the robustness of different similarity functions and their possible parameterizations against irrelevant input dimensions. Empirical evidence is given for the different properties of the membership functions
Keywords :
data handling; fuzzy set theory; learning (artificial intelligence); pattern clustering; probability; cluster partitions; cluster prototypes; coincidental cluster centers; data point similarity; irrelevant attributes; membership functions; probabilistic fuzzy clustering; similarity function; simultaneous closeness; Clustering algorithms; Data engineering; Design engineering; Fuzzy sets; Iterative algorithms; Knowledge engineering; Partitioning algorithms; Prototypes; Robustness; Shape measurement;
Conference_Titel :
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location :
Reno, NV
Print_ISBN :
0-7803-9159-4
DOI :
10.1109/FUZZY.2005.1452507