DocumentCode
3486990
Title
Multi-script Text Extraction from Natural Scenes
Author
Gomez, L. ; Karatzas, Dimosthenis
Author_Institution
Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
467
Lastpage
471
Abstract
Scene text extraction methodologies are usually based in classification of individual regions or patches, using a priori knowledge for a given script or language. Human perception of text, on the other hand, is based on perceptual organisation through which text emerges as a perceptually significant group of atomic objects. Therefore humans are able to detect text even in languages and scripts never seen before. In this paper, we argue that the text extraction problem could be posed as the detection of meaningful groups of regions. We present a method built around a perceptual organisation framework that exploits collaboration of proximity and similarity laws to create text-group hypotheses. Experiments demonstrate that our algorithm is competitive with state of the art approaches on a standard dataset covering text in variable orientations and two languages.
Keywords
character recognition; feature extraction; multiscript text extraction; perceptual organisation framework; proximity law; scene text extraction methodologies; similarity law; text extraction problem; text perception; text-group hypothesis; Collaboration; Computer vision; Feature extraction; Image edge detection; Organizations; Semantics; Text recognition; Localisation; Perceptual grouping; Scene text; Segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.100
Filename
6628665
Link To Document