DocumentCode :
759502
Title :
Multiscale Classification Using Nearest Neighbor Density Estimates
Author :
Ghosh, Anil K. ; Chaudhuri, Probal ; Murthy, C.A.
Author_Institution :
Math. Sci. Inst., Australian Nat. Univ., Canberra, ACT
Volume :
36
Issue :
5
fYear :
2006
Firstpage :
1139
Lastpage :
1148
Abstract :
Density estimates based on k-nearest neighbors have useful applications in nonparametric discriminant analysis. In classification problems, optimal values of k are usually estimated by minimizing the cross-validated misclassification rates. However, these cross-validation techniques allow only one value of k for each population density estimate, while in a classification problem, the optimum value of k for a class may also depend on its competing population densities. Further, it is computationally difficult to minimize the cross-validated error rate when there are several competing populations. Moreover, in addition to depending on the entire training data set, a good choice of k should also depend on the specific observation to be classified. Therefore, instead of using a single value of k for each population density estimate, it is more useful in practice to consider the results for multiple values of k to arrive at the final decision. This paper presents one such approach along with a graphical device, which gives more information about classification results for various choices of k and the related statistical uncertainties present there. The utility of this proposed methodology has been illustrated using some benchmark data sets
Keywords :
learning (artificial intelligence); pattern classification; probability; statistical analysis; cross-validation technique; multiscale classification; nearest neighbor density estimates; nonparametric discriminant analysis; population density estimate; training data set; Accuracy; Australia; Density functional theory; Error analysis; Mathematics; Nearest neighbor searches; Probability; Size control; Training data; Uncertainty; Bootstrap; cross validation; misclassification rate; multiscale analysis; posterior probability; weighted averaging;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2006.873186
Filename :
1703655
Link To Document :
بازگشت