• DocumentCode
    3485740
  • Title

    Image Binarization for End-to-End Text Understanding in Natural Images

  • Author

    Milyaev, Sergey ; Barinova, Olga ; Novikova, Tatiana ; Kohli, Pushmeet ; Lempitsky, Victor

  • Author_Institution
    Lomonosov Moscow State Univ., Moscow, Russia
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    128
  • Lastpage
    132
  • Abstract
    While modern off-the-shelf OCR engines show particularly high accuracy on scanned text, text detection and recognition in natural images still remains a challenging problem. Here, we demonstrate that OCR engines can still perform well on this harder task as long as appropriate image binarization is applied to input photographs. For such binarization, we systematically evaluate the performance of 12 binarization methods as well as of a new binarization algorithm that we propose here. Our evaluation includes different metrics and uses established natural image text recognition benchmarks (ICDAR 2003 and ICDAR 2011). Our main finding is thus the fact that image binarization methods combined with additional filtering of generated connected components and off-the-shelf OCR engines can achieve state-of-the-art performance for end-to-end text understanding in natural images.
  • Keywords
    image recognition; natural scenes; text analysis; text detection; ICDAR 2003; ICDAR 2011; connected component filtering; end-to-end text understanding; image binarization methods; input photographs; natural image text recognition benchmarks; natural images; off-the-shelf OCR engines; performance evaluation; text detection; text recognition; text scanning; Accuracy; Engines; Image recognition; Image segmentation; Optical character recognition software; Pipelines; Text recognition; natural scene binarization; text localization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.33
  • Filename
    6628598