DocumentCode
1158171
Title
Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation
Author
Weinman, Jerod J. ; Learned-Miller, Erik ; Hanson, Allen R.
Author_Institution
Dept. of Comput. Sci., Grinnell Coll., Grinnell, IA, USA
Volume
31
Issue
10
fYear
2009
Firstpage
1733
Lastpage
1746
Abstract
Scene text recognition (STR) is the recognition of text anywhere in the environment, such as signs and storefronts. Relative to document recognition, it is challenging because of font variability, minimal language context, and uncontrolled conditions. Much information available to solve this problem is frequently ignored or used sequentially. Similarity between character images is often overlooked as useful information. Because of language priors, a recognizer may assign different labels to identical characters. Directly comparing characters to each other, rather than only a model, helps ensure that similar instances receive the same label. Lexicons improve recognition accuracy but are used post hoc. We introduce a probabilistic model for STR that integrates similarity, language properties, and lexical decision. Inference is accelerated with sparse belief propagation, a bottom-up method for shortening messages by reducing the dependency between weakly supported hypotheses. By fusing information sources in one model, we eliminate unrecoverable errors that result from sequential processing, improving accuracy. In experimental results recognizing text from images of signs in outdoor scenes, incorporating similarity reduces character recognition error by 19 percent, the lexicon reduces word recognition error by 35 percent, and sparse belief propagation reduces the lexicon words considered by 99.9 percent with a 12X speedup and no loss in accuracy.
Keywords
Markov processes; character recognition; image recognition; STR; character image; document recognition; lexicon; scene text recognition; sparse belief propagation; Scene text recognition; belief propagation; conditional random fields; factor graphs; graphical models; language model; lexicon; optical character recognition; similarity; sparse belief propagation.;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2009.38
Filename
4782969
Link To Document