Title of article :
Testing for Multivariate Outliers in the Presence of Missing Data
Author/Authors :
W. A. Woodward، نويسنده , , S. R. Sain، نويسنده , , H. L. Gray، نويسنده , , B. Zhao ، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2002
Abstract :
We consider the problem of multivariate outlier testing for purposes of distinguishing
seismic signals of underground nuclear events from training samples based on non-nuclear seismic events
when certain data are missing. We consider the case in which the training data follow a multivariate normal
distribution. Assume a potential outlier is observed on which k features of interest are measured. Assume
further that the available training set of n observations on these k features is available but that some of the
observations in the training data have missing features. The approach currently used in practice is to
perform the outlier testing using a generalized likelihood ratio test procedure based only on the data
vectors in the training data with complete data. When there is a substantial amount of missing data within
the training set, use of this strategy may lead to a loss of valuable information. An alternative procedure is
to incorporate all n of the data vectors in the training data using the EM algorithm to appropriately handle
the missing data in the training set. Resampling methods are used to find appropriate critical regions. We
use simulation results and analysis of models fit to Pg/Lg ratios for the WMQ station in China to compare
these two strategies for dealing with missing data.
Keywords :
Outlier testing , nuclear monitoring , Multivariate normal , EM algorithm , Missing data
Journal title :
Pure and Applied Geophysics
Journal title :
Pure and Applied Geophysics