DocumentCode
3487434
Title
Improving Open-Vocabulary Scene Text Recognition
Author
Feild, Jacqueline L. ; Learned-Miller, Erik G.
Author_Institution
Dept. of Comput. Sci., Univ. of Massachusetts Amherst, Amherst, MA, USA
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
604
Lastpage
608
Abstract
This paper presents a system for open-vocabulary text recognition in images of natural scenes. First, we describe a novel technique for text segmentation that models smooth color changes across images. We combine this with a recognition component based on a conditional random field with histogram of oriented gradients descriptors and incorporate language information from a lexicon to improve recognition performance. Many existing techniques for this problem use language information from a standard lexicon, but these may not include many of the words found in images of the environment, such as storefront signs and street signs. We avoid this limitation by incorporating language information from a large web-based lexicon of around 13.5 million words. This lexicon contains words encountered during a crawl of the web, so it is likely to contain proper nouns, like business names and street names. We show that our text segmentation method allows for better recognition performance than the current state-of-the-art text segmentation method. We also evaluate this full system on two standard data sets, ICDAR 2003 and ICDAR 2011, and show an increase in word recognition performance compared to the current state-of-the-art methods.
Keywords
Internet; gradient methods; image colour analysis; image segmentation; natural language processing; text analysis; vocabulary; Web based lexicon; Web crawler; business names; conditional random field; histogram of oriented gradients descriptors; language information; natural scenes; open vocabulary scene text recognition; proper nouns; smooth color models; standard lexicon; storefront signs; street names; street signs; text segmentation; Accuracy; Error correction; Image color analysis; Image recognition; Image segmentation; Standards; Text recognition; scene text recognition; scene text segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.125
Filename
6628690
Link To Document