Title :
Classification using a hierarchical Bayesian approach
Author :
Mathis, Charles ; Breuel, Thomas
Author_Institution :
Document Image Decoding Group, Xerox Palo Alto Res. Center, CA, USA
Abstract :
A key problem faced by classifiers is coping with styles not represented in the training set. We present an application of hierarchical Bayesian methods to the problem of recognizing degraded printed characters in a variety of fonts. The proposed method works by using training data of various styles and classes to compute prior distributions on the parameters for the class conditional distributions. For classification, the parameters for the actual class conditional distributions are fitted using an EM algorithm. The advantage of hierarchical Bayesian methods is motivated with a theoretical example. Severalfold increases in classification performance relative to style-oblivious and style-conscious are demonstrated on a multifont OCR task.
Keywords :
Bayes methods; approximation theory; learning (artificial intelligence); optical character recognition; optimisation; pattern classification; EM algorithm; OCR; hierarchical Bayesian methods; maximum likelihood approximation; pattern classification; printed character recognition; style-conscious classification; training set; Bayesian methods; Character recognition; Context modeling; Decoding; Degradation; Distributed computing; Optical character recognition software; Optical coupling; Shape; Training data;
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-1695-X
DOI :
10.1109/ICPR.2002.1047410