DocumentCode :
697795
Title :
Adaptive cluster-based outlier detection
Author :
Strutz, Tilo
Author_Institution :
Deutsche Telekom AG, Hochschule fur Telekommunikation, Leipzig, Germany
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1710
Lastpage :
1714
Abstract :
The analysis of data is typically accompanied by concern as to the correctness of recorded data points; some of the points might be contaminated, thereby distorting the result of the analysis. This paper proposes a novel cluster-based and distribution-independent method for outlier detection. Based on Monte Carlo simulations, the new method is tested with different data distributions and compared with the method of standardised residuals (also known as the z-score). It is shown that the cluster-based approach identifies outliers more reliably, even for a normal data distribution, and the advantages are discussed in detail.
Keywords :
Monte Carlo methods; data analysis; normal distribution; Monte Carlo simulation; adaptive cluster-based outlier detection; data analysis; distribution-independent method; normal data distribution; recorded data points; standardised residuals; Data analysis; Data models; Distributed databases; Estimation; Gaussian distribution; Standards; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077367
Link To Document :
بازگشت