DocumentCode
2496977
Title
The effect of finite sample size on the holdout error probability estimator of homoscedastic multi-class Gaussian classification problems
Author
El Ayadi, Moataz ; Plataniotis, Konstantinos N.
Author_Institution
Edward S. Rogers Sr. Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
fYear
2010
fDate
18-23 July 2010
Firstpage
1
Lastpage
8
Abstract
Consider a homoscedastic multi-class Gaussian classification problem where the class mean vectors and the common covariance matrix are not known to the practitioner. Rather, they are estimated from given sample vectors available for each class. In this paper, an empirical procedure for approximating the bias of the holdout estimator of the Bayesian error probability (BEP) is presented. Synthetic experiments demonstrate the accuracy of the proposed procedure and how it can be used for guiding the practitioner about the necessary amount of data vectors required to achieve a certain level of accuracy in the BEP estimation. When applied to real world classification problems from the UCI machine learning repository, the proposed procedure was successfully used to estimate the test error probability based on the training data only. Moreover, with a reasonable degree of accuracy, the proposed procedure predicted the test BEP when the amount of the training data in increased.
Keywords
Bayes methods; Gaussian processes; error statistics; learning (artificial intelligence); pattern classification; BEP estimation; Bayesian error probability; UCI machine learning repository; covariance matrix; error probability; finite sample size; holdout error probability estimator; homoscedastic multiclass Gaussian classification problems; training data; Accuracy; Covariance matrix; Error probability; Estimation; Mathematical model; Nickel; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location
Barcelona
ISSN
1098-7576
Print_ISBN
978-1-4244-6916-1
Type
conf
DOI
10.1109/IJCNN.2010.5596888
Filename
5596888
Link To Document