DocumentCode
785901
Title
Robust fuzzy clustering of relational data
Author
Davé, Rajesh N. ; Sen, Sumit
Author_Institution
Dept. of Mech. Eng., New Jersey Inst. of Technol., Newark, NJ, USA
Volume
10
Issue
6
fYear
2002
fDate
12/1/2002 12:00:00 AM
Firstpage
713
Lastpage
727
Abstract
Popular relational-data clustering algorithms, relational dual of fuzzy c-means (RFCM), non-Euclidean RFCM (NERFCM) (both by Hathaway et al), and FANNY (by Kaufman and Rousseeuw) are examined. A new algorithm, which is a generalization of FANNY, called the fuzzy relational data clustering (FRC) algorithm, is introduced, having an identical objective functional as RFCM. However, the FRC does not have the restriction of RFCM, which is that the relational data is derived from Euclidean distance as the measure of dissimilarity between the objects, and it also does not have limitations of FANNY, including the use of a fixed membership exponent, or a fuzzifier exponent, m. The FRC algorithm is further improved by incorporating the concept of Dave´s object data noise clustering (NC) algorithm, done by proposing a concept of noise-dissimilarity. Next, based on the constrained minimization, which includes an inequality constraint for the memberships and corresponding Kuhn-Tucker conditions, a noise resistant, FRC algorithm is derived which works well for all types of non-Euclidean dissimilarity data. Thus it is shown that the extra computations for data expansion (β-spread transformation) required by the NERFCM algorithm are not necessary. This new algorithm is called robust non-Euclidean fuzzy relational data clustering (robust-NE-FRC), and its robustness is demonstrated through several numerical examples. Advantages of this new algorithm are: faster convergence, robustness against outliers, and ability to handle all kinds of relational data, including non-Euclidean. The paper also presents a new and better interpretation of the noise-class.
Keywords
fuzzy set theory; fuzzy systems; minimisation; pattern clustering; relational databases; β-spread transformation; Dave object data noise clustering algorithm; Euclidean distance; FANNY; Kuhn-Tucker conditions; constrained minimization; convergence; data expansion; dissimilarity measure; fixed membership exponent; fuzzifier exponent; fuzzy relational data clustering algorithm; inequality constraint; noise dissimilarity; nonEuclidean RFCM; numerical examples; outlier robustness; relational data; relational dual of fuzzy c-means; relational-data clustering algorithms; robust fuzzy clustering; robust nonEuclidean fuzzy relational data clustering; Clustering algorithms; Clustering methods; Convergence; Data engineering; Engineering management; Euclidean distance; Industrial engineering; Microcomputers; Minimization methods; Noise robustness;
fLanguage
English
Journal_Title
Fuzzy Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-6706
Type
jour
DOI
10.1109/TFUZZ.2002.805899
Filename
1097772
Link To Document