Title :
Scene Text Localization and Recognition with Oriented Stroke Detection
Author :
Neumann, Luka ; Matas, Jose
Author_Institution :
Dept. of Cybern., Czech Tech. Univ., Prague, Czech Republic
Abstract :
An unconstrained end-to-end text localization and recognition method is presented. The method introduces a novel approach for character detection and recognition which combines the advantages of sliding-window and connected component methods. Characters are detected and recognized as image regions which contain strokes of specific orientations in a specific relative position, where the strokes are efficiently detected by convolving the image gradient field with a set of oriented bar filters. Additionally, a novel character representation efficiently calculated from the values obtained in the stroke detection phase is introduced. The representation is robust to shift at the stroke level, which makes it less sensitive to intra-class variations and the noise induced by normalizing character size and positioning. The effectiveness of the representation is demonstrated by the results achieved in the classification of real-world characters using an euclidian nearest-neighbor classifier trained on synthetic data in a plain form. The method was evaluated on a standard dataset, where it achieves state-of-the-art results in both text localization and recognition.
Keywords :
character recognition; image representation; text detection; Euclidian nearest-neighbor classifier; character detection; character recognition; character representation; character size normalization; connected component method; image gradient field; image regions; intraclass variations; oriented bar filter set; oriented stroke detection; positioning normalization; scene text localization; scene text recognition; sliding-window method; specific relative position; standard dataset; stroke detection efficiency; stroke detection phase; stroke level; unconstrained end-to-end text localization method; unconstrained end-to-end text recognition method; Character recognition; Noise; Optical character recognition software; Robustness; Text recognition; Training; Vectors; nature scene text; text localization; text recognition; text-in-the-wild;
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICCV.2013.19