DocumentCode :
1115125
Title :
Document image decoding using Markov source models
Author :
Kopec, Gary E. ; Chou, Philip A.
Author_Institution :
Inf. Sci. & Technol. Lab., Xerox Palo Alto Res. Center, CA, USA
Volume :
16
Issue :
6
fYear :
1994
fDate :
6/1/1994 12:00:00 AM
Firstpage :
602
Lastpage :
617
Abstract :
Document image decoding (DID) is a communication theory approach to document image recognition. In DID, a document recognition problem is viewed as consisting of three elements: an image generator, a noisy channel and an image decoder. A document image generator is a Markov source (stochastic finite-state automaton) that combines a message source with an imager. The message source produces a string of symbols, or text, that contains the information to be transmitted. The imager is modeled as a finite-state transducer that converts the 1D message string into an ideal 2D bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message, given the observed image, by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like dynamic programming algorithm. The proposed approach is illustrated on the problem of decoding scanned telephone yellow pages to extract names and numbers from the listings. A finite-state model for yellow page columns was constructed and used to decode a database of scanned column images containing about 1100 individual listings
Keywords :
document image processing; dynamic programming; hidden Markov models; image coding; 1D message string; 2D bitmap; Markov source models; Viterbi-like dynamic programming; channel models; communication theory; decoder; document image decoding; document image recognition; finite state model; message source; stochastic finite state automaton; Automata; Decoding; Dynamic programming; Heuristic algorithms; Image converters; Image generation; Image recognition; Noise generators; Stochastic processes; Transducers;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.295905
Filename :
295905
Link To Document :
بازگشت