DocumentCode
3485740
Title
Image Binarization for End-to-End Text Understanding in Natural Images
Author
Milyaev, Sergey ; Barinova, Olga ; Novikova, Tatiana ; Kohli, Pushmeet ; Lempitsky, Victor
Author_Institution
Lomonosov Moscow State Univ., Moscow, Russia
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
128
Lastpage
132
Abstract
While modern off-the-shelf OCR engines show particularly high accuracy on scanned text, text detection and recognition in natural images still remains a challenging problem. Here, we demonstrate that OCR engines can still perform well on this harder task as long as appropriate image binarization is applied to input photographs. For such binarization, we systematically evaluate the performance of 12 binarization methods as well as of a new binarization algorithm that we propose here. Our evaluation includes different metrics and uses established natural image text recognition benchmarks (ICDAR 2003 and ICDAR 2011). Our main finding is thus the fact that image binarization methods combined with additional filtering of generated connected components and off-the-shelf OCR engines can achieve state-of-the-art performance for end-to-end text understanding in natural images.
Keywords
image recognition; natural scenes; text analysis; text detection; ICDAR 2003; ICDAR 2011; connected component filtering; end-to-end text understanding; image binarization methods; input photographs; natural image text recognition benchmarks; natural images; off-the-shelf OCR engines; performance evaluation; text detection; text recognition; text scanning; Accuracy; Engines; Image recognition; Image segmentation; Optical character recognition software; Pipelines; Text recognition; natural scene binarization; text localization;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.33
Filename
6628598
Link To Document