DocumentCode :
991159
Title :
Large margin nearest neighbor classifiers
Author :
Domeniconi, Carlotta ; Gunopulos, Dimitrios ; Peng, Jing
Author_Institution :
Inf. & Software Eng. Dept., George Mason Univ., Fairfax, VA, USA
Volume :
16
Issue :
4
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
899
Lastpage :
909
Abstract :
The nearest neighbor technique is a simple and appealing approach to addressing classification problems. It relies on the assumption of locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with a finite number of examples due to the curse of dimensionality. Severe bias can be introduced under these conditions when using the nearest neighbor rule. The employment of a locally adaptive metric becomes crucial in order to keep class conditional probabilities close to uniform, thereby minimizing the bias of estimates. We propose a technique that computes a locally flexible metric by means of support vector machines (SVMs). The decision function constructed by SVMs is used to determine the most discriminant direction in a neighborhood around the query. Such a direction provides a local feature weighting scheme. We formally show that our method increases the margin in the weighted space where classification takes place. Moreover, our method has the important advantage of online computational efficiency over competing locally adaptive techniques for nearest neighbor classification. We demonstrate the efficacy of our method using both real and simulated data.
Keywords :
pattern classification; probability; support vector machines; feature weighting scheme; locally constant class conditional probability; nearest neighbor classification; online computational efficiency; support vector machine; Computational efficiency; Computational modeling; Computer science; Employment; Engineering profession; Error analysis; Nearest neighbor searches; Software engineering; Support vector machine classification; Support vector machines; Feature relevance; margin; nearest neighbor classification; support vector machines (SVMs); Algorithms; Artificial Intelligence; Breast Neoplasms; Computer Simulation; Computing Methodologies; Decision Support Techniques; Diabetes Mellitus; Diagnosis, Computer-Assisted; Humans; Models, Biological; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Stochastic Processes;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2005.849821
Filename :
1461432
Link To Document :
بازگشت