DocumentCode
3316204
Title
Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables
Author
Honda, Katsuhiro ; Uesugi, Ryo ; Ichihashi, Hidetomo ; Notsu, Akira
Author_Institution
Osaka Prefecture Univ., Osaka
fYear
2007
fDate
23-26 July 2007
Firstpage
1
Lastpage
6
Abstract
This paper proposes a new approach to linear fuzzy clustering of mixed databases, in which categorical variables are quantified in each cluster based on optimal scaling. The objective function of the Fuzzy c-Varieties (FCV) clustering is defined using least squares criterion, and local principal component analysis (local PCA) is then performed in each cluster considering quantified scores of categorical variables. The new approach quantifies categorical variables in each cluster so that they suit the local linear model of the cluster. So, this is the second approach to optimal scaling in linear fuzzy clustering and contrasts to the global approach where categorical variables are quantified so that they suit for constructing a single numerical data space. The clustering algorithm is an enhanced FCV algorithm that includes an additional step for quantifying categorical variables in each cluster, and is useful for revealing cluster-wise mutual dependencies among numerical and nominal variables rather than for revealing geometrical relationships among data samples.
Keywords
database management systems; fuzzy set theory; least squares approximations; pattern clustering; principal component analysis; categorical variable; fuzzy c-variety clustering; least square criterion; linear fuzzy cluster-wise optimal scaling; mixed database; objective function; principal component analysis; Clustering algorithms; Data analysis; Data mining; Databases; Iterative algorithms; Iterative methods; Least squares methods; Partitioning algorithms; Principal component analysis; Prototypes;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
Conference_Location
London
ISSN
1098-7584
Print_ISBN
1-4244-1209-9
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2007.4295398
Filename
4295398
Link To Document