• DocumentCode
    1638310
  • Title

    Segmentation-free Word Spotting in Historical Printed Documents

  • Author

    Gatos, B. ; Pratikakis, I.

  • Author_Institution
    Comput. Intell. Lab., Nat. Res. Center Demokritos, Athens, Greece
  • fYear
    2009
  • Firstpage
    271
  • Lastpage
    275
  • Abstract
    In this paper, a new efficient word spotting methodology is presented that can be applied to historical printed documents without requiring any previous block or word segmentation step. Our aim is to address a methodology which is segmentation-free since in many cases of historical documents, the segmentation process does not produce meaningful results due to unconstraint layout, several degradations or typesetting imperfections. The proposed method is based on block-based document image descriptors that are used at a template matching process satisfying invariance in terms of translation, rotation and scaling. Improvement in terms of time expense is obtained by applying the matching process only on salient regions of the image. Experimental results on a database with representative historical printed documents prove the efficiency of the proposed approach.
  • Keywords
    document image processing; history; image matching; block-based document image descriptor; historical printed document image; image rotation; image scaling; image translation; segmentation-free word spotting; template matching process; typesetting imperfection; unconstraint layout; Computational intelligence; Degradation; Image databases; Image retrieval; Image segmentation; Informatics; Laboratories; Optical character recognition software; Text analysis; Typesetting; Historical Documents; Segmentation-free analysis; Word Spotting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.236
  • Filename
    5277703