Title of article :
Applications of maximum likelihood principal component analysis: incomplete data sets and calibration transfer
Author/Authors :
Darren T. Andrews، نويسنده , , Peter D. Wentzell، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 1997
Pages :
12
From page :
341
To page :
352
Abstract :
The application of a new method to the multivariate analysis of incomplete data sets is described. The new method, called maximum likelihood principal component analysis (MLPCA), is analogous to conventional principal component analysis (PCA), but incorporates measurement error variance information in the decomposition of multivariate data. Missing measurements can be handled in a reliable and simple manner by assigning large measurement uncertainties to them. The problem of missing data is pervasive in chemistry, and MLPCA is applied to three sets of experimental data to illustrate its utility. For exploratory data analysis, a data set from the analysis of archeological artifacts is used to show that the principal components extracted by MLPCA retain much of the original information even when a significant number of measurements are missing. Maximum likelihood projections of censored data can often preserve original clusters among the samples and can, through the propagation of error, indicate which samples are likely to be projected erroneously. To demonstrate its utility in modeling applications, MLPCA is also applied in the development of a model for chromatographic retention based on a data set which is only 80% complete. MLPCA can predict missing values and assign error estimates to these points. Finally, the problem of calibration transfer between instruments can be regarded as a missing data problem in which entire spectra are missing on the ‘slave’ instrument. Using NIR spectra obtained from two instruments, it is shown that spectra on the slave instrument can be predicted from a small subset of calibration transfer samples even if a different wavelength range is employed. Concentration prediction errors obtained by this approach were comparable to cross-validation errors obtained for the slave instrument when all spectra were available.
Keywords :
Exploratory data analysis , maximum likelihood , Missing data , Principal component analysis , Calibration transfer
Journal title :
Analytica Chimica Acta
Serial Year :
1997
Journal title :
Analytica Chimica Acta
Record number :
1025373
Link To Document :
بازگشت