DocumentCode :
2910072
Title :
A PRESS statistic for two-block partial least squares regression
Author :
McWilliams, Brian ; Montana, Giovanni
Author_Institution :
Dept. of Math., Imperial Coll. London, London, UK
fYear :
2010
fDate :
8-10 Sept. 2010
Firstpage :
1
Lastpage :
6
Abstract :
Predictive modelling of multivariate data where both the covariates and responses are high-dimensional is becoming an increasingly popular task in many data mining applications. Partial Least Squares (PLS) regression often turns out to be a useful model in these situations since it performs dimensionality reduction by assuming the existence of a small number of latent factors that may explain the linear dependence between input and output. In practice, the number of latent factors to be retained, which controls the complexity of the model and its predictive ability, has to be carefully selected. Typically this is done by cross validating a performance measure, such as the predictive error. Although cross validation works well in many practical settings, it can be computationally expensive. Various extensions to PLS have also been proposed for regularising the PLS solution and performing simultaneous dimensionality reduction and variable selection, but these come at the expense of additional complexity parameters that also need to be tuned by cross-validation. In this paper we derive a computationally efficient alternative to leave-one-out cross validation (LOOCV), a predicted sum of squares (PRESS) statistic for two-block PLS. We show that the PRESS is nearly identical to LOOCV but has the computational expense of only a single PLS model fit. Examples of the PRESS for selecting the number of latent factors and regularisation parameters are provided.
Keywords :
data mining; least squares approximations; regression analysis; PRESS statistic; cross validation; data mining; leave-one-out cross validation; multivariate data; predictive modelling; two block partial least squares regression; Approximation error; Computational modeling; Covariance matrix; Data models; Predictive models; Presses; Sensitivity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence (UKCI), 2010 UK Workshop on
Conference_Location :
Colchester
Print_ISBN :
978-1-4244-8774-5
Electronic_ISBN :
978-1-4244-8773-8
Type :
conf
DOI :
10.1109/UKCI.2010.5625583
Filename :
5625583
Link To Document :
بازگشت