• DocumentCode
    1063750
  • Title

    Minimal consistent set (MCS) identification for optimal nearest neighbor decision systems design

  • Author

    Dasarathy, Belur V.

  • Author_Institution
    Dynetics Inc., Huntsville, AL, USA
  • Volume
    24
  • Issue
    3
  • fYear
    1994
  • fDate
    3/1/1994 12:00:00 AM
  • Firstpage
    511
  • Lastpage
    517
  • Abstract
    A new approach is presented in this study for tackling the problem of high computational demands of nearest neighbor (NN) based decision systems. The approach, based on the concept of an optimal subset selection from a given training data set, derives a consistent subset which is aimed to be minimal in size. This minimal consistent subset (MCS) selection, in contrast to most of the other previous attempts of this nature, leads to an unique solution irrespective of the initial order of presentation of the data. Further, consistency property is assured at every iteration. Also, unlike under most prior approaches, the samples are selected here in the order of significance of their contribution for enabling the consistency property. This provides insight into the relative significance of the samples in the training set. Experimental results based on a number of independent training and test data sets are presented and discussed to illustrate the methodology and bring to focus its benefits. These results show that the nearest neighbor decision system performance suffers little degradation when the given large training set is replaced by its much smaller MCS in the operational phase of testing with an independent test set. A direct experimental comparison with a prior approach is also furnished to further strengthen the case for the new methodology
  • Keywords
    decision theory; pattern recognition; set theory; computational demands; consistency property; minimal consistent set identification; optimal nearest neighbor decision systems; training data set; Books; Computational efficiency; Degradation; Nearest neighbor searches; Neural networks; Pattern recognition; Prototypes; System performance; System testing; Training data;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/21.278999
  • Filename
    278999