Title :
A fast approach to building rough data model through G-K fuzzy clustering
Author :
Huang, Jin-jie ; Li, Shi-yong ; Ban, Xiao-jun
Author_Institution :
Dept. of Control Sci. & Eng., Harbin Inst. of Technol., China
Abstract :
A new method to fast build the rough data model (RDM) by means of fuzzy clustering is proposed. The scheme is contrived by Gustafson-Kessel (GK) algorithm, which is of many good properties and is demonstrated in the data-mining context. In this paper, first we investigate how to integrate the RDM´s classification quality performance index into the GK clustering algorithm in the product space of input and output variables. Then we suggest the way to convert the fuzzy cluster models to rough data models. Hence, we work out an efficient algorithm that can obtain RDMs by just iteratively computing two necessary condition equations, which can minimize the objective function, and turn the multi-dimensional search process of Kowalczyk´s method to one dimensional search strategy (in terms of the number of clusters). This technique reduces the searching time greatly. Moreover, by introducing the concept of the fuzzy degree of fulfillment (DoF) to a cluster rule, our approach seems to be much more flexible and more powerful ability in handling data contaminated by noise, with better generalization ability compared with the traditional rough set theory and the Kowalczyk´s method. Finally, two examples illustrate the effectiveness of our approach.
Keywords :
data mining; data models; fuzzy set theory; iterative methods; pattern clustering; rough set theory; Gustafson-Kessel algorithm; Kowalczyks method; classification quality performance; data handling; data-mining; fuzzy cluster models; fuzzy clustering; fuzzy degree of fulfillment; rough data model; search process; Body sensor networks; Clustering algorithms; Data engineering; Data models; Fuzzy control; Iterative algorithms; Machine learning; Performance analysis; Pollution measurement; Set theory;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1259743