Title of article
Likelihood inference in nearest-neighbour classification models
Author/Authors
Adams، Niall M. نويسنده , , Holmes، Christopher C. نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2002
Pages
-98
From page
99
To page
0
Abstract
Traditionally the neighbourhood size k in the k-nearest-neighbour algorithm is either fixed at the first nearest neighbour or is selected on the basis of a crossvalidation study. In this paper we present an alternative approach that develops the k-nearestneighbour algorithm using likelihood-based inference. Our method takes the form of a generalised linear regression on a set of knearest-neighbour autocovariates. By defining the k-nearest-neighbour algorithm in this way we are able to extend the method to accommodate the original predictor variables as possible linear effects as well as allowing for the inclusion of multiple nearestneighbour terms. The choice of the final model proceeds via a stepwise regression procedure. It is shown that our method incorporates a conventional generalised linear model and a conventional k-nearest-neighbour algorithm as special cases. Empirical results suggest that the method out-performs the standard k-nearest-neighbour method in terms of misclassification rate on a wide variety of datasets.
Keywords
Hyperbolic distribution , Generalised inverse Gaussian distribution , Modified Bessel function of the third kind , Positive skewness
Journal title
Biometrika
Serial Year
2002
Journal title
Biometrika
Record number
71724
Link To Document