• DocumentCode
    3497899
  • Title

    OCR voting methods for recognizing low contrast printed documents

  • Author

    Marosi, I.

  • Author_Institution
    ScanSoft-Recognita, Inc., Burlington, MA
  • fYear
    2006
  • fDate
    27-28 April 2006
  • Lastpage
    115
  • Abstract
    Modern adaptive thresholding algorithms do their best to provide good quality binarized images. Unfortunately, it\´s hard to find a good compromise between the amount of background noise in the binary result and the amount of breaks or missing parts in the shape of characters if the original grey image has low contrast. In this paper, we describe some voting methods starting from an external "black box" voter, to a more deeply integrated "shape" voter that can be used to generate even better recognition results by running a voting OCR engine on two, differently thresholded images
  • Keywords
    document image processing; optical character recognition; OCR voting methods; adaptive thresholding algorithms; background noise; external black box voter; good quality binarized images; integrated shape voter; low contrast grey image; low contrast printed document recognition; optical character recognition; thresholded images; voting OCR engine; Background noise; Character recognition; Engines; Filtering algorithms; Image recognition; Noise shaping; Optical character recognition software; Shape; Text recognition; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    0-7695-2531-8
  • Type

    conf

  • DOI
    10.1109/DIAL.2006.28
  • Filename
    1612953