• DocumentCode
    2331802
  • Title

    Outlier Detection in Benchmark Classification Tasks

  • Author

    Li, Hongyu ; Niranjan, Mahesan

  • Author_Institution
    Dept. of Comput. Sci., Sheffield Univ.
  • Volume
    5
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    We present a new outlier detection method which is appropriate for classification problems. It combines estimating the overall probability density and sequential ranking of the data according to observed changes in performance on validation sets. The method has been implemented on ten widely used benchmark datasets and a spam email filtering application. Evaluated by six popular machine learning methods, classification performances are shown to improve after removing outliers in comparison to removing the same number of examples at random from the datasets
  • Keywords
    information filtering; learning (artificial intelligence); probability; benchmark classification tasks; machine learning methods; outlier detection; overall probability density; sequential ranking; spam email filtering application; Additive noise; Biological system modeling; Computer science; Electronic mail; Instruments; Labeling; Noise robustness; Performance evaluation; Predictive models; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1661336
  • Filename
    1661336