Title of article
Feature selection and weighting by nearest neighbor ensembles
Author/Authors
Gertheiss، نويسنده , , Jan and Tutz، نويسنده , , Gerhard، نويسنده ,
Issue Information
دوفصلنامه با شماره پیاپی سال 2009
Pages
9
From page
30
To page
38
Abstract
In the field of statistical discrimination nearest neighbor methods are a well known, quite simple but successful nonparametric classification tool. If the number of predictors increases, however, predictive power normally deteriorates. In general, if some covariates are assumed to be noise variables, variable selection is a promising approach. The paperʹs main focus is on the development and evaluation of a nearest neighbor ensemble with implicit variable selection. In contrast to other nearest neighbor approaches we are not primarily interested in classification, but in estimating the (posterior) class probabilities. In simulation studies and for real world data the proposed nearest neighbor ensemble is compared to an extended forward/backward variable selection procedure for nearest neighbor classifiers, and some alternative well established classification tools (that offer probability estimates as well). Despite its simple structure, the proposed methodʹs performance is quite good — especially if relevant covariates can be separated from noise variables. Another advantage of the presented ensemble is the easy identification of interactions that are usually hard to detect. So not simply variable selection but rather some kind of feature selection is performed.
Keywords
Nearest neighbor methods , variable selection , Classification , Ensemble methods
Journal title
Chemometrics and Intelligent Laboratory Systems
Serial Year
2009
Journal title
Chemometrics and Intelligent Laboratory Systems
Record number
1489574
Link To Document