Decoding of text lines in grayscale document images

Author

Popat, Kris

Author_Institution

Xerox Palo Alto Res. Center, CA, USA

Volume

3

fYear

2001

fDate

2001

Firstpage

1513

Abstract

The Document Image Decoding (DID) framework for recognizing printed text in images has been shown in previous work to achieve extremely high recognition accuracy when its models are well matched to the data. To date, DID has been restricted to binary images, in part for computational reasons, and in part because binary scanning is widely available and often of sufficient spatial resolution to make the use of grayscale information unnecessary for reliable recognition. Advances in computer speed and memory, along with the emergence of low-cost digital still cameras and similar devices as alternatives to traditional scanners, motivates the extension of the DID formalism to the low-spatial-resolution grayscale and color domains. To do so requires substantially generalizing DID´s image-formation and degradation models. This paper lays out an approach and presents preliminary results on real data

Keywords

decoding; document image processing; image colour analysis; image recognition; image resolution; text analysis; DID framework; Document Image Decoding framework; color images; grayscale document images; image degradation; image formation; image recognition; low spatial resolution grayscale images; printed text recognition; text line decoding; Books; Decoding; Degradation; Digital cameras; Gray-scale; Image recognition; Printing; Software libraries; Spatial resolution; Text recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location

Salt Lake City, UT

ISSN

1520-6149

Print_ISBN

0-7803-7041-4

Type

conf

DOI

10.1109/ICASSP.2001.941219

Filename

941219