Title of article :
Statistical computation of feature weighting schemes through data estimation for nearest neighbor classifiers
Author/Authors :
Sلez، نويسنده , , José A. and Derrac، نويسنده , , Joaquيn and Luengo، نويسنده , , Juliلn and Herrera، نويسنده , , Francisco، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Abstract :
The Nearest Neighbor rule is one of the most successful classifiers in machine learning. However, it is very sensitive to noisy, redundant and irrelevant features, which may cause its performance to deteriorate. Feature weighting methods try to overcome this problem by incorporating weights into the similarity function to increase or reduce the importance of each feature, according to how they behave in the classification task. This paper proposes a new feature weighting classifier, in which the computation of the weights is based on a novel idea combining imputation methods – used to estimate a new distribution of values for each feature based on the rest of the data – and the Kolmogorov–Smirnov nonparametric statistical test to measure the changes between the original and imputed distribution of values. This proposal is compared with classic and recent feature weighting methods. The experimental results show that our feature weighting scheme is very resilient to the choice of imputation method and is an effective way of improving the performance of the Nearest Neighbor classifier, outperforming the rest of the classifiers considered in the comparisons.
Keywords :
imputation methods , nearest neighbor , Classification , Feature weighting
Journal title :
PATTERN RECOGNITION
Journal title :
PATTERN RECOGNITION