• DocumentCode
    991159
  • Title

    Large margin nearest neighbor classifiers

  • Author

    Domeniconi, Carlotta ; Gunopulos, Dimitrios ; Peng, Jing

  • Author_Institution
    Inf. & Software Eng. Dept., George Mason Univ., Fairfax, VA, USA
  • Volume
    16
  • Issue
    4
  • fYear
    2005
  • fDate
    7/1/2005 12:00:00 AM
  • Firstpage
    899
  • Lastpage
    909
  • Abstract
    The nearest neighbor technique is a simple and appealing approach to addressing classification problems. It relies on the assumption of locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with a finite number of examples due to the curse of dimensionality. Severe bias can be introduced under these conditions when using the nearest neighbor rule. The employment of a locally adaptive metric becomes crucial in order to keep class conditional probabilities close to uniform, thereby minimizing the bias of estimates. We propose a technique that computes a locally flexible metric by means of support vector machines (SVMs). The decision function constructed by SVMs is used to determine the most discriminant direction in a neighborhood around the query. Such a direction provides a local feature weighting scheme. We formally show that our method increases the margin in the weighted space where classification takes place. Moreover, our method has the important advantage of online computational efficiency over competing locally adaptive techniques for nearest neighbor classification. We demonstrate the efficacy of our method using both real and simulated data.
  • Keywords
    pattern classification; probability; support vector machines; feature weighting scheme; locally constant class conditional probability; nearest neighbor classification; online computational efficiency; support vector machine; Computational efficiency; Computational modeling; Computer science; Employment; Engineering profession; Error analysis; Nearest neighbor searches; Software engineering; Support vector machine classification; Support vector machines; Feature relevance; margin; nearest neighbor classification; support vector machines (SVMs); Algorithms; Artificial Intelligence; Breast Neoplasms; Computer Simulation; Computing Methodologies; Decision Support Techniques; Diabetes Mellitus; Diagnosis, Computer-Assisted; Humans; Models, Biological; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Stochastic Processes;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2005.849821
  • Filename
    1461432