• DocumentCode
    3851104
  • Title

    On dimensionality, sample size, and classification error of nonparametric linear classification algorithms

  • Author

    S. Raudys

  • Author_Institution
    Inst. of Math. & Inf., Vilnius Univ., Lithuania
  • Volume
    19
  • Issue
    6
  • fYear
    1997
  • Firstpage
    667
  • Lastpage
    671
  • Abstract
    This paper compares two nonparametric linear classification algorithms $the zero empirical error classifier and the maximum margin classifier - with parametric linear classifiers designed to classify multivariate Gaussian populations. Formulae and a table for the mean expected probability of misclassification MEP/sub N/ are presented. They show that the classification error is mainly determined by N/p, a learning-set size/dimensionality ratio. However, the influences of learning-set size on the generalization error of parametric and nonparametric linear classifiers are quite different. Under certain conditions the nonparametric approach allows us to obtain reliable rules, even in cases where the number of features is larger than the number of training vectors.
  • Keywords
    "Classification algorithms","Euclidean distance","Machine learning","Algorithm design and analysis","Pattern recognition","Support vector machines","Support vector machine classification","Estimation error","Information theory","Physics"
  • Journal_Title
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.601254
  • Filename
    601254