DocumentCode :
2453062
Title :
A Novel Noise Filtering Algorithm for Imbalanced Data
Author :
Van Hulse, Jason ; Khoshgoftaar, Taghi M. ; Napolitano, Amri
Author_Institution :
Florida Atlantic Univ., Boca Raton, FL, USA
fYear :
2010
fDate :
12-14 Dec. 2010
Firstpage :
9
Lastpage :
14
Abstract :
Noise filtering is a commonly-used methodology to improve the performance of learners built using low-quality data. A common type of noise filtering is a data preprocessing technique called classification filtering. In classification filtering, a classifier is built and evaluated on the training dataset (typically using cross-validation) and any misclassified instances are considered noisy. The strategies employed with classification filters are not ideal, particularly when learning from class-imbalanced data. To address this deficiency, we propose an alternative method for classification filtering called the threshold-adjusted classification filter. This methodology is compared with the standard classification filter, and the results clearly demonstrate the efficacy of our technique.
Keywords :
filtering theory; noise; pattern classification; cross-validation; data preprocessing technique; imbalanced data; noise filtering algorithm; threshold-adjusted classification filter; training dataset; Neodymium; Niobium; Noise; Noise level; Noise measurement; Training; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
Type :
conf
DOI :
10.1109/ICMLA.2010.9
Filename :
5708806
Link To Document :
بازگشت