DocumentCode
2331802
Title
Outlier Detection in Benchmark Classification Tasks
Author
Li, Hongyu ; Niranjan, Mahesan
Author_Institution
Dept. of Comput. Sci., Sheffield Univ.
Volume
5
fYear
2006
fDate
14-19 May 2006
Abstract
We present a new outlier detection method which is appropriate for classification problems. It combines estimating the overall probability density and sequential ranking of the data according to observed changes in performance on validation sets. The method has been implemented on ten widely used benchmark datasets and a spam email filtering application. Evaluated by six popular machine learning methods, classification performances are shown to improve after removing outliers in comparison to removing the same number of examples at random from the datasets
Keywords
information filtering; learning (artificial intelligence); probability; benchmark classification tasks; machine learning methods; outlier detection; overall probability density; sequential ranking; spam email filtering application; Additive noise; Biological system modeling; Computer science; Electronic mail; Instruments; Labeling; Noise robustness; Performance evaluation; Predictive models; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1661336
Filename
1661336
Link To Document