DocumentCode
3497899
Title
OCR voting methods for recognizing low contrast printed documents
Author
Marosi, I.
Author_Institution
ScanSoft-Recognita, Inc., Burlington, MA
fYear
2006
fDate
27-28 April 2006
Lastpage
115
Abstract
Modern adaptive thresholding algorithms do their best to provide good quality binarized images. Unfortunately, it\´s hard to find a good compromise between the amount of background noise in the binary result and the amount of breaks or missing parts in the shape of characters if the original grey image has low contrast. In this paper, we describe some voting methods starting from an external "black box" voter, to a more deeply integrated "shape" voter that can be used to generate even better recognition results by running a voting OCR engine on two, differently thresholded images
Keywords
document image processing; optical character recognition; OCR voting methods; adaptive thresholding algorithms; background noise; external black box voter; good quality binarized images; integrated shape voter; low contrast grey image; low contrast printed document recognition; optical character recognition; thresholded images; voting OCR engine; Background noise; Character recognition; Engines; Filtering algorithms; Image recognition; Noise shaping; Optical character recognition software; Shape; Text recognition; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
Conference_Location
Lyon
Print_ISBN
0-7695-2531-8
Type
conf
DOI
10.1109/DIAL.2006.28
Filename
1612953
Link To Document