• Title of article

    Statistical computation of feature weighting schemes through data estimation for nearest neighbor classifiers

  • Author/Authors

    Sلez، نويسنده , , José A. and Derrac، نويسنده , , Joaquيn and Luengo، نويسنده , , Juliلn and Herrera، نويسنده , , Francisco، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2014
  • Pages
    8
  • From page
    3941
  • To page
    3948
  • Abstract
    The Nearest Neighbor rule is one of the most successful classifiers in machine learning. However, it is very sensitive to noisy, redundant and irrelevant features, which may cause its performance to deteriorate. Feature weighting methods try to overcome this problem by incorporating weights into the similarity function to increase or reduce the importance of each feature, according to how they behave in the classification task. This paper proposes a new feature weighting classifier, in which the computation of the weights is based on a novel idea combining imputation methods – used to estimate a new distribution of values for each feature based on the rest of the data – and the Kolmogorov–Smirnov nonparametric statistical test to measure the changes between the original and imputed distribution of values. This proposal is compared with classic and recent feature weighting methods. The experimental results show that our feature weighting scheme is very resilient to the choice of imputation method and is an effective way of improving the performance of the Nearest Neighbor classifier, outperforming the rest of the classifiers considered in the comparisons.
  • Keywords
    imputation methods , nearest neighbor , Classification , Feature weighting
  • Journal title
    PATTERN RECOGNITION
  • Serial Year
    2014
  • Journal title
    PATTERN RECOGNITION
  • Record number

    1736719