DocumentCode
697795
Title
Adaptive cluster-based outlier detection
Author
Strutz, Tilo
Author_Institution
Deutsche Telekom AG, Hochschule fur Telekommunikation, Leipzig, Germany
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
1710
Lastpage
1714
Abstract
The analysis of data is typically accompanied by concern as to the correctness of recorded data points; some of the points might be contaminated, thereby distorting the result of the analysis. This paper proposes a novel cluster-based and distribution-independent method for outlier detection. Based on Monte Carlo simulations, the new method is tested with different data distributions and compared with the method of standardised residuals (also known as the z-score). It is shown that the cluster-based approach identifies outliers more reliably, even for a normal data distribution, and the advantages are discussed in detail.
Keywords
Monte Carlo methods; data analysis; normal distribution; Monte Carlo simulation; adaptive cluster-based outlier detection; data analysis; distribution-independent method; normal data distribution; recorded data points; standardised residuals; Data analysis; Data models; Distributed databases; Estimation; Gaussian distribution; Standards; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077367
Link To Document