• DocumentCode
    710077
  • Title

    Detection and visualisation of outliers using kernel principal components

  • Author

    Nasser, Alissar ; Hamad, Denis

  • Author_Institution
    Fac. of Econ. & Bus. Adm., Lebanese Univ., Hadath, Lebanon
  • fYear
    2015
  • fDate
    April 29 2015-May 1 2015
  • Firstpage
    119
  • Lastpage
    124
  • Abstract
    We apply, in this article, a new method to identify outliers from a dataset. It consists to use the K-means clustering algorithm on the smallest principal components provided by the kernel principal components analysis. Two leading methods commonly used in the domain namely the standard deviation and the Tukey boxplot are tested and compared to our method. The experiments on artificial and real datasets show that our approach better detects outliers than the two classical methods.
  • Keywords
    pattern clustering; principal component analysis; Tukey boxplot; k-means clustering algorithm; kernel principal components analysis; outlier detection; outlier visualisation; standard deviation; Algorithm design and analysis; Clustering algorithms; Decision support systems; Eigenvalues and eigenfunctions; Kernel; Principal component analysis; Standards; K-means; Outlier detection; Tukey boxplot; kernel principal component analysis; non-linear projection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information and Communication Technology and its Applications (DICTAP), 2015 Fifth International Conference on
  • Conference_Location
    Beirut
  • Print_ISBN
    978-1-4799-4130-8
  • Type

    conf

  • DOI
    10.1109/DICTAP.2015.7113183
  • Filename
    7113183