Title of article :
Testing for Multivariate Outliers in the Presence of Missing Data
Author/Authors :
W. A. Woodward، نويسنده , , S. R. Sain، نويسنده , , H. L. Gray، نويسنده , , B. Zhao ، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2002
Pages :
15
From page :
889
To page :
903
Abstract :
We consider the problem of multivariate outlier testing for purposes of distinguishing seismic signals of underground nuclear events from training samples based on non-nuclear seismic events when certain data are missing. We consider the case in which the training data follow a multivariate normal distribution. Assume a potential outlier is observed on which k features of interest are measured. Assume further that the available training set of n observations on these k features is available but that some of the observations in the training data have missing features. The approach currently used in practice is to perform the outlier testing using a generalized likelihood ratio test procedure based only on the data vectors in the training data with complete data. When there is a substantial amount of missing data within the training set, use of this strategy may lead to a loss of valuable information. An alternative procedure is to incorporate all n of the data vectors in the training data using the EM algorithm to appropriately handle the missing data in the training set. Resampling methods are used to find appropriate critical regions. We use simulation results and analysis of models fit to Pg/Lg ratios for the WMQ station in China to compare these two strategies for dealing with missing data.
Keywords :
Outlier testing , nuclear monitoring , Multivariate normal , EM algorithm , Missing data
Journal title :
Pure and Applied Geophysics
Serial Year :
2002
Journal title :
Pure and Applied Geophysics
Record number :
429441
Link To Document :
بازگشت