DocumentCode
3851104
Title
On dimensionality, sample size, and classification error of nonparametric linear classification algorithms
Author
S. Raudys
Author_Institution
Inst. of Math. & Inf., Vilnius Univ., Lithuania
Volume
19
Issue
6
fYear
1997
Firstpage
667
Lastpage
671
Abstract
This paper compares two nonparametric linear classification algorithms $the zero empirical error classifier and the maximum margin classifier - with parametric linear classifiers designed to classify multivariate Gaussian populations. Formulae and a table for the mean expected probability of misclassification MEP/sub N/ are presented. They show that the classification error is mainly determined by N/p, a learning-set size/dimensionality ratio. However, the influences of learning-set size on the generalization error of parametric and nonparametric linear classifiers are quite different. Under certain conditions the nonparametric approach allows us to obtain reliable rules, even in cases where the number of features is larger than the number of training vectors.
Keywords
"Classification algorithms","Euclidean distance","Machine learning","Algorithm design and analysis","Pattern recognition","Support vector machines","Support vector machine classification","Estimation error","Information theory","Physics"
Journal_Title
IEEE Transactions on Pattern Analysis and Machine Intelligence
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/34.601254
Filename
601254
Link To Document