DocumentCode :
2023912
Title :
Fuzzy-rough k-nearest neighbor algorithm for imbalanced data sets learning
Author :
Han, Hui ; Mao, Binghuan
Author_Institution :
Sch. of Inf. Sci. & Technol., Beijing Forestry Univ., Beijing, China
Volume :
3
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
1286
Lastpage :
1290
Abstract :
Learning from imbalanced data sets presents a new challenge to machine learning community, as traditional methods are biased to majority classes and produce poor detection rate of minority classes. This paper presents a new approach, namely fuzzy-rough k-nearest neighbor algorithm for imbalanced data sets learning to improve the classification performance of minority class. The approach defines fuzzy membership function that is in favor of minority class and constructs fuzzy equivalent relation between the unlabeled instance and its k nearest neighbors. The approach takes the fuzziness and roughness of the nearest neighbors of an instance into consideration, and can reduce the disturbance of majority class to minority class. Experiments show that our new approach improves not only the classification performance of minority class more effectively, but also the classification performance of the whole data set comparing with other methods.
Keywords :
fuzzy set theory; learning (artificial intelligence); pattern classification; rough set theory; fuzzy equivalent relation; fuzzy membership function; fuzzy rough k-nearest neighbor algorithm; imbalanced data sets learning; machine learning; Approximation methods; Classification algorithms; Fuzzy set theory; Machine learning; Nearest neighbor searches; Rough sets; fuzzy set theory; fuzzy-rough set theory; imbalanced data set; rough set theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569116
Filename :
5569116
Link To Document :
بازگشت