Title :
A fuzzy approach to partitioning continuous attributes for classification
Author :
Au, Wai-Ho ; Chan, Keith C C ; Wong, Andrew K.C.
Author_Institution :
Microsoft Corp., Redmond, WA, USA
fDate :
5/1/2006 12:00:00 AM
Abstract :
Classification is an important topic in data mining research. To better handle continuous data, fuzzy sets are used to represent interval events in the domains of continuous attributes, allowing continuous data lying on the interval boundaries to partially belong to multiple intervals. Since the membership functions of fuzzy sets can profoundly affect the performance of the models or rules discovered, the determination of membership functions or fuzzy partitioning is crucial. In this paper, we present a new method to determine the membership functions of fuzzy sets directly from data to maximize the class-attribute interdependence and, hence, improve the classification results. In other words, it forms a fuzzy partition of the input space automatically, using an information-theoretic measure to evaluate the interdependence between the class membership and an attribute as the objective function for fuzzy partitioning. To find the optimum of the measure, it employs fractional programming. To evaluate the effectiveness of the proposed method, several real-world data sets are used in our experiments. The experimental results show that this method outperforms other well-known discretization and fuzzy partitioning approaches.
Keywords :
data mining; fuzzy set theory; nonlinear programming; pattern classification; data classification; data mining; discretization approach; fractional programming; fuzzy partitioning; fuzzy set; information-theoretic measure; membership function; rule discovery; Biological cells; Clustering algorithms; Data mining; Dynamic programming; Fuzzy neural networks; Fuzzy sets; Genetic algorithms; Information theory; Neural networks; Partitioning algorithms; Fuzzy partition; classification; data mining.; discretization; fuzzy sets; membership functions;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2006.70