Title :
A fuzzy classifier for imbalanced and noisy data
Author :
Visa, Sofia ; Ralescu, Anca
Author_Institution :
Dept. of Electr. Comput. & Eng. Comput. Sci., Cincinnati Univ., OH, USA
Abstract :
This paper deals with the learning concept in the presence of noise (overlap) and imbalance in the training set. The starting assumption is that recognition of the smaller class is much more important than that of the larger class. A fuzzy classifier capable of achieving this based on the relation between fuzzy sets and probability distributions as mediated by the theory of mass assignment. Two approaches to construct fuzzy sets - basic and modified - using the lpd and mpd selection rules are investigated. Preliminary results suggest that the use of mpd selection rule in conjunction with the modified approach is better for recalling the small class at a small cost to the recognition of the negative class.
Keywords :
fuzzy set theory; learning (artificial intelligence); probability; basic fuzzy set; fuzzy classifier; imbalanced data; learning concept; mass assignment; modified fuzzy set; noisy data; probability distribution; selection rules; Algorithm design and analysis; Classification algorithms; Costs; Data preprocessing; Frequency conversion; Fuzzy set theory; Fuzzy sets; Level set; Probability distribution; Terminology;
Conference_Titel :
Fuzzy Systems, 2004. Proceedings. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8353-2
DOI :
10.1109/FUZZY.2004.1375444