Title :
Automatic prototype extraction for adaptive OCR
Author :
Nagy, George ; Xu, Yihong
Author_Institution :
Dept. of Electr. Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
Abstract :
A Bayesian method of isolating character bitmaps from paragraph-length samples of heavily degraded text images is demonstrated. The method requires a transcript of the text, but it is sufficiently robust to tolerate errors in transcripts obtained from multifont commercial OCR software. The resulting prototypes (labeled character images) are used to recognize additional text an the same document
Keywords :
Bayes methods; adaptive signal processing; document image processing; optical character recognition; Bayesian method; adaptive OCR; additional text recognition; automatic prototype extraction; character bitmap isolation; error tolerance; heavily degraded text images; labeled character images; multifont commercial OCR software; paragraph-length samples; text transcript; Bayesian methods; Character recognition; Degradation; Design engineering; Image recognition; Optical character recognition software; Prototypes; Robustness; Systems engineering and theory; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.619856