Title :
On Producing Balanced Fuzzy Decision Tree Classifiers
Author :
Crockett, Keeley ; Bandar, Zuhair ; O´Shea, James
Author_Institution :
The Manchester Metropolitan Univ., Manchester
Abstract :
This paper investigates a new approach to creating robust fuzzy classifiers that are impartial to the imbalance problem within data sets. The approach uses raw real-world data without the need for sampling or the creation of synthetic examples. The aim is to achieve common currency between the actual classification accuracy and the distribution of this accuracy between the outcome classes. The proposed method first uses a fuzzy inference algorithm (FIA) to construct a fuzzy classifier from a crisp C4.5. A genetic algorithm (GA) is then used to optimize the degree of fuzziness within the classifier. The GA´s fitness function consists of two components: classification accuracy and the distribution (or balance) of this accuracy between the outcomes. Both components are optimised concurrently. Four alternative fitness functions are defined, each of which applies different penalties on the classification accuracy depending on a weighting associated with the balance component. The method is then applied to three real world data sets. The results show that it is possible to attain a fuzzy classifier which exhibits both good performance and balance between outcomes regardless of any imbalance within the data set.
Keywords :
decision trees; fuzzy reasoning; fuzzy set theory; genetic algorithms; pattern classification; sampling methods; balanced fuzzy decision tree classifier; crisp C4.5; data sets; fuzzy inference algorithm; genetic algorithm; robust fuzzy classifier; sampling method; Classification tree analysis; Costs; Decision trees; Fuzzy sets; Fuzzy systems; Genetic algorithms; Induction generators; Inference algorithms; Robustness; Sampling methods;
Conference_Titel :
Fuzzy Systems, 2006 IEEE International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9488-7
DOI :
10.1109/FUZZY.2006.1681943