DocumentCode
1749878
Title
Decoding of text lines in grayscale document images
Author
Popat, Kris
Author_Institution
Xerox Palo Alto Res. Center, CA, USA
Volume
3
fYear
2001
fDate
2001
Firstpage
1513
Abstract
The Document Image Decoding (DID) framework for recognizing printed text in images has been shown in previous work to achieve extremely high recognition accuracy when its models are well matched to the data. To date, DID has been restricted to binary images, in part for computational reasons, and in part because binary scanning is widely available and often of sufficient spatial resolution to make the use of grayscale information unnecessary for reliable recognition. Advances in computer speed and memory, along with the emergence of low-cost digital still cameras and similar devices as alternatives to traditional scanners, motivates the extension of the DID formalism to the low-spatial-resolution grayscale and color domains. To do so requires substantially generalizing DID´s image-formation and degradation models. This paper lays out an approach and presents preliminary results on real data
Keywords
decoding; document image processing; image colour analysis; image recognition; image resolution; text analysis; DID framework; Document Image Decoding framework; color images; grayscale document images; image degradation; image formation; image recognition; low spatial resolution grayscale images; printed text recognition; text line decoding; Books; Decoding; Degradation; Digital cameras; Gray-scale; Image recognition; Printing; Software libraries; Spatial resolution; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.941219
Filename
941219
Link To Document