Multi-script Text Extraction from Natural Scenes

Author

Gomez, L. ; Karatzas, Dimosthenis

Author_Institution

Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain

fYear

2013

fDate

25-28 Aug. 2013

Firstpage

467

Lastpage

471

Abstract

Scene text extraction methodologies are usually based in classification of individual regions or patches, using a priori knowledge for a given script or language. Human perception of text, on the other hand, is based on perceptual organisation through which text emerges as a perceptually significant group of atomic objects. Therefore humans are able to detect text even in languages and scripts never seen before. In this paper, we argue that the text extraction problem could be posed as the detection of meaningful groups of regions. We present a method built around a perceptual organisation framework that exploits collaboration of proximity and similarity laws to create text-group hypotheses. Experiments demonstrate that our algorithm is competitive with state of the art approaches on a standard dataset covering text in variable orientations and two languages.

Keywords

character recognition; feature extraction; multiscript text extraction; perceptual organisation framework; proximity law; scene text extraction methodologies; similarity law; text extraction problem; text perception; text-group hypothesis; Collaboration; Computer vision; Feature extraction; Image edge detection; Organizations; Semantics; Text recognition; Localisation; Perceptual grouping; Scene text; Segmentation;

fLanguage

English

Publisher

ieee

Conference_Titel

Document Analysis and Recognition (ICDAR), 2013 12th International Conference on

Conference_Location

Washington, DC

ISSN

1520-5363

Type

conf

DOI

10.1109/ICDAR.2013.100

Filename

6628665