DocumentCode :
1115388
Title :
Capacity and Error Estimates for Boolean Classifiers with Limited Complexity
Author :
Pearl, Judea
Author_Institution :
SENIOR MEMBER, IEEE, School of Engineering and Applied Science, University of California, Los Angeles, CA 90024.
Issue :
4
fYear :
1979
Firstpage :
350
Lastpage :
356
Abstract :
This paper extends the notions of capacity and distribution-free error estimation to nonlinear Boolean classifiers on patterns with binary-valued features. We establish quantitative relationships between the dimensionality of the feature vectors (d), the combinational complexity of the decision rule (c), the number of samples in the training set (n), and the classification performance of the resulting classifier. Our results state that the discriminating capacity of Boolean classifiers is given by the product dc, and the probability of ambiguous generalization is asymptotically given by (n/dc-1)-1 0(log d)/d) for large d, and n=0(dc). In addition we show that if a fraction ¿ of the training samples is misclassified then the probability of error (¿) in subsequent samples satisfies P(|¿-¿| ¿) m=<2.773 exp (dc-e2n/8) for all distributions, regardless of how the classifier was discovered.
Keywords :
Bayesian methods; Capacity planning; Error analysis; Pattern classification; Pattern recognition; Size measurement; Vectors; Boolean classifiers; capacity; dimensionality; error estimation; measurement complexity; nonparametric classification; sample size;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.1979.4766943
Filename :
4766943
Link To Document :
بازگشت