• DocumentCode
    140755
  • Title

    Discriminative features for identifying and interpreting outliers

  • Author

    Xuan Hong Dang ; Assent, Ira ; Ng, Raymond T. ; Zimek, Arthur ; Schubert, Eugen

  • Author_Institution
    Dept. of Comput. Sci., Aarhus Univ., Aarhus, Denmark
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    88
  • Lastpage
    99
  • Abstract
    We consider the problem of outlier detection and interpretation. While most existing studies focus on the first problem, we simultaneously address the equally important challenge of outlier interpretation. We propose an algorithm that uncovers outliers in subspaces of reduced dimensionality in which they are well discriminated from regular objects while at the same time retaining the natural local structure of the original data to ensure the quality of outlier explanation. Our algorithm takes a mathematically appealing approach from the spectral graph embedding theory and we show that it achieves the globally optimal solution for the objective of subspace learning. By using a number of real-world datasets, we demonstrate its appealing performance not only w.r.t. the outlier detection rate but also w.r.t. the discriminative human-interpretable features. This is the first approach to exploit discriminative features for both outlier detection and interpretation, leading to better understanding of how and why the hidden outliers are exceptional.
  • Keywords
    data analysis; data mining; statistical distributions; discriminative human-interpretable features; outlier detection; outlier explanation quality; outlier interpretation; spectral graph embedding theory; Data mining; Eigenvalues and eigenfunctions; Face; Feature extraction; Linear programming; Sparse matrices; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816642
  • Filename
    6816642