• DocumentCode
    3486990
  • Title

    Multi-script Text Extraction from Natural Scenes

  • Author

    Gomez, L. ; Karatzas, Dimosthenis

  • Author_Institution
    Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    467
  • Lastpage
    471
  • Abstract
    Scene text extraction methodologies are usually based in classification of individual regions or patches, using a priori knowledge for a given script or language. Human perception of text, on the other hand, is based on perceptual organisation through which text emerges as a perceptually significant group of atomic objects. Therefore humans are able to detect text even in languages and scripts never seen before. In this paper, we argue that the text extraction problem could be posed as the detection of meaningful groups of regions. We present a method built around a perceptual organisation framework that exploits collaboration of proximity and similarity laws to create text-group hypotheses. Experiments demonstrate that our algorithm is competitive with state of the art approaches on a standard dataset covering text in variable orientations and two languages.
  • Keywords
    character recognition; feature extraction; multiscript text extraction; perceptual organisation framework; proximity law; scene text extraction methodologies; similarity law; text extraction problem; text perception; text-group hypothesis; Collaboration; Computer vision; Feature extraction; Image edge detection; Organizations; Semantics; Text recognition; Localisation; Perceptual grouping; Scene text; Segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.100
  • Filename
    6628665