• DocumentCode
    2011085
  • Title

    Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR

  • Author

    Elagouni, Khaoula ; Garcia, Christophe ; Mamalet, Franck ; Sébillot, Pascale

  • Author_Institution
    Orange Labs. R&D, Cesson-Sevigne, France
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    120
  • Lastpage
    124
  • Abstract
    Understanding text captured in real-world scenes is a challenging problem in the field of visual pattern recognition and continues to generate a significant interest in the OCR (Optical Character Recognition) community. This paper proposes a novel method to recognize scene texts avoiding the conventional character segmentation step. The idea is to scan the text image with multi-scale windows and apply a robust recognition model, relying on a neural classification approach, to every window in order to recognize valid characters and identify non valid ones. Recognition results are represented as a graph model in order to determine the best sequence of characters. Some linguistic knowledge is also incorporated to remove errors due to recognition confusions. The designed method is evaluated on the ICDAR 2003 database of scene text images and outperforms state-of-the-art approaches.
  • Keywords
    computational linguistics; graph theory; natural scenes; neural nets; optical character recognition; text detection; ICDAR 2003 database; OCR community; character segmentation step; error removal; graph model; linguistic knowledge; multiscale character recognition; multiscale windows; natural scene text OCR; neural classification approach; optical character recognition; real-world scenes; robust recognition model; scene text recognition; text image scanning; visual pattern recognition; Character recognition; Image color analysis; Image recognition; Image segmentation; Optical character recognition software; Robustness; Text recognition; convolutional neural networks; language model; multi-scale character recognition; scene text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.26
  • Filename
    6195347