DocumentCode :
2152414
Title :
A new error bound for the classifier chosen by early stopping
Author :
Bax, Eric ; Cataltepe, Zehra ; Sill, Joe
Author_Institution :
California Inst. of Technol., Pasadena, CA, USA
Volume :
2
fYear :
1997
fDate :
20-22 Aug 1997
Firstpage :
811
Abstract :
Training with early stopping is the following process. Partition the in-sample data into training and validation sets. Begin with a random classifier g1. Use an iterative method to decrease the error rate on the training data. Record the classifier at each iteration, producing a series of snapshots g1,…gM . Evaluate the error rate of each snapshot over the validation data. Deliver a minimum validation error classifier, g*, as the result of training. The purpose of the paper is to develop a good probabilistic upper bound on the error rate of g* over out-of-sample (test) data. First, we use a validation oriented version of VC analysis (V.N. Vapnik, 1982; V.N. Vapnik and A. Chervonenkis, 1971) to develop a bound. Because of the nature of VC analysis, this initial bound is based on worst case assumptions about the rates of agreement among snapshots. In practice, though, successive snapshots are similar classifiers. We exploit this feature to develop a new bound. Then we test the bound on credit card data
Keywords :
credit transactions; error analysis; financial data processing; iterative methods; learning (artificial intelligence); pattern classification; probability; VC analysis; credit card data; early stopping; error bound; error rate; in-sample data; iterative method; machine learning; minimum validation error classifier; out-of-sample test data; probabilistic upper bound; random classifier; snapshots; successive snapshots; training; validation data; validation oriented version; validation sets; worst case assumptions; Credit cards; Error analysis; Iterative methods; Machine learning; Testing; Training data; Upper bound; Virtual colonoscopy; Virtual manufacturing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and Signal Processing, 1997. 10 Years PACRIM 1987-1997 - Networking the Pacific Rim. 1997 IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-3905-3
Type :
conf
DOI :
10.1109/PACRIM.1997.620383
Filename :
620383
Link To Document :
بازگشت