DocumentCode
2182311
Title
Parallel Computation of Modified Stahel-Donoho Estimators for Multivariate Outlier Detection
Author
Wada, Kazuyoshi ; Tsubaki, Hiroe
Author_Institution
Nat. Stat. Center, Tokyo, Japan
fYear
2013
fDate
16-19 Dec. 2013
Firstpage
304
Lastpage
311
Abstract
Modified Stahel-Donoho (MSD) estimators are an orthogonally equivariant multivariate outlier detection method with a high breakdown point for all dimensions. An R function of the MSD estimators is created and its performance is confirmed, however, the method suffers from the curse of dimensionality and its implementation is limited to relatively low dimensional datasets. This paper proposes a parallel computing approach to cope with higher dimensionality and presents results for a few datasets to illustrate its use. Code for both the utilized parallelized function and the original single-core function have been placed in a public repository for further evaluation.
Keywords
Big Data; estimation theory; parallel algorithms; Big Data; MSD estimators; curse of dimensionality; low dimensional datasets; modified Stahel-Donoho estimators; multivariate outlier detection method; parallel algorithm; parallel computing approach; parallelized function; single-core function; Computational modeling; Correlation; Covariance matrices; Electric breakdown; Robustness; Software; Vectors; Mahalanobis distance; multivariate location and scatter; outlier detection; projection pursuit;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on
Conference_Location
Fuzhou
Print_ISBN
978-1-4799-2829-3
Type
conf
DOI
10.1109/CLOUDCOM-ASIA.2013.86
Filename
6821008
Link To Document