Title :
A genetic learning of the fuzzy rule-based classification system granularity for highly imbalanced data-sets
Author :
Villar, Pedro ; Fernández, Alberto ; Herrera, Francisco
Author_Institution :
Dept. of Software Eng., Univ. of Granada, Granada, Spain
Abstract :
In this contribution we analyse the significance of the granularity level (number of labels) in Fuzzy Rule-Based Classification Systems in the scenario of data-sets with a high imbalance degree. We refer to imbalanced data-sets when the class distribution is not uniform, a situation that it is present in many real application areas. The aim of this work is to adapt the number of fuzzy labels for each problem, applying a fine granularity in those variables which have a higher dispersion of values and a thick granularity in the variables where an excessive number of labels may result irrelevant. We compare this methodology with the use of a fixed number of labels and with the C4.5 decision tree.
Keywords :
decision trees; fuzzy set theory; genetic algorithms; pattern classification; class distribution; decision tree; fine granularity; fuzzy labels; fuzzy rule-based classification system granularity; fuzzy rule-based classification systems; genetic learning; granularity level; highly imbalanced datasets; imbalance degree; Biomedical equipment; Decision trees; Face recognition; Fuzzy systems; Genetics; Learning systems; Machine learning; Medical services; Risk management; Stress;
Conference_Titel :
Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-1-4244-3596-8
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2009.5277304