Title :
Image Binarization for End-to-End Text Understanding in Natural Images
Author :
Milyaev, Sergey ; Barinova, Olga ; Novikova, Tatiana ; Kohli, Pushmeet ; Lempitsky, Victor
Author_Institution :
Lomonosov Moscow State Univ., Moscow, Russia
Abstract :
While modern off-the-shelf OCR engines show particularly high accuracy on scanned text, text detection and recognition in natural images still remains a challenging problem. Here, we demonstrate that OCR engines can still perform well on this harder task as long as appropriate image binarization is applied to input photographs. For such binarization, we systematically evaluate the performance of 12 binarization methods as well as of a new binarization algorithm that we propose here. Our evaluation includes different metrics and uses established natural image text recognition benchmarks (ICDAR 2003 and ICDAR 2011). Our main finding is thus the fact that image binarization methods combined with additional filtering of generated connected components and off-the-shelf OCR engines can achieve state-of-the-art performance for end-to-end text understanding in natural images.
Keywords :
image recognition; natural scenes; text analysis; text detection; ICDAR 2003; ICDAR 2011; connected component filtering; end-to-end text understanding; image binarization methods; input photographs; natural image text recognition benchmarks; natural images; off-the-shelf OCR engines; performance evaluation; text detection; text recognition; text scanning; Accuracy; Engines; Image recognition; Image segmentation; Optical character recognition software; Pipelines; Text recognition; natural scene binarization; text localization;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.33