• DocumentCode
    3487434
  • Title

    Improving Open-Vocabulary Scene Text Recognition

  • Author

    Feild, Jacqueline L. ; Learned-Miller, Erik G.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Massachusetts Amherst, Amherst, MA, USA
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    604
  • Lastpage
    608
  • Abstract
    This paper presents a system for open-vocabulary text recognition in images of natural scenes. First, we describe a novel technique for text segmentation that models smooth color changes across images. We combine this with a recognition component based on a conditional random field with histogram of oriented gradients descriptors and incorporate language information from a lexicon to improve recognition performance. Many existing techniques for this problem use language information from a standard lexicon, but these may not include many of the words found in images of the environment, such as storefront signs and street signs. We avoid this limitation by incorporating language information from a large web-based lexicon of around 13.5 million words. This lexicon contains words encountered during a crawl of the web, so it is likely to contain proper nouns, like business names and street names. We show that our text segmentation method allows for better recognition performance than the current state-of-the-art text segmentation method. We also evaluate this full system on two standard data sets, ICDAR 2003 and ICDAR 2011, and show an increase in word recognition performance compared to the current state-of-the-art methods.
  • Keywords
    Internet; gradient methods; image colour analysis; image segmentation; natural language processing; text analysis; vocabulary; Web based lexicon; Web crawler; business names; conditional random field; histogram of oriented gradients descriptors; language information; natural scenes; open vocabulary scene text recognition; proper nouns; smooth color models; standard lexicon; storefront signs; street names; street signs; text segmentation; Accuracy; Error correction; Image color analysis; Image recognition; Image segmentation; Standards; Text recognition; scene text recognition; scene text segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.125
  • Filename
    6628690