• DocumentCode
    697795
  • Title

    Adaptive cluster-based outlier detection

  • Author

    Strutz, Tilo

  • Author_Institution
    Deutsche Telekom AG, Hochschule fur Telekommunikation, Leipzig, Germany
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1710
  • Lastpage
    1714
  • Abstract
    The analysis of data is typically accompanied by concern as to the correctness of recorded data points; some of the points might be contaminated, thereby distorting the result of the analysis. This paper proposes a novel cluster-based and distribution-independent method for outlier detection. Based on Monte Carlo simulations, the new method is tested with different data distributions and compared with the method of standardised residuals (also known as the z-score). It is shown that the cluster-based approach identifies outliers more reliably, even for a normal data distribution, and the advantages are discussed in detail.
  • Keywords
    Monte Carlo methods; data analysis; normal distribution; Monte Carlo simulation; adaptive cluster-based outlier detection; data analysis; distribution-independent method; normal data distribution; recorded data points; standardised residuals; Data analysis; Data models; Distributed databases; Estimation; Gaussian distribution; Standards; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077367