Title of article :
Detecting “bad” regression models: multicriteria fitness functions in regression analysis Original Research Article
Author/Authors :
Roberto Todeschini، نويسنده , , Viviana Consonni، نويسنده , , Andrea Mauri، نويسنده , , Manuela Pavan، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Pages :
10
From page :
199
To page :
208
Abstract :
Regression models with good fitting but no predictive ability are sometimes chance correlations and often show some pathological features such as multicollinearity, overfitting, and inclusion of noisy/spurious variables. This problem is well known and of the utmost importance. The present paper proposes some criteria that are to be fulfilled as conditions for model acceptability, the aim being to recognize linear regression models with pathology. These criteria have been thought of in order to face the following problems: • model instability due to outliers and influential objects; • predictor multicollinearity; • redundancy in explanatory variables; • overfitting due to chance factors. A multicriteria fitness function based on the maximization of the Q2 statistics under a set of tests is proposed here. This new fitness function can also be used in model searching by variable selection approaches in order to obtain a final optimal population of models. Computations on the Selwood data set are reported to illustrate the use of this multicriteria fitness function in model searching.
Keywords :
Regression analysis , Multicriteria decision making , variable selection , Selwood data set
Journal title :
Analytica Chimica Acta
Serial Year :
2004
Journal title :
Analytica Chimica Acta
Record number :
1034213
Link To Document :
بازگشت