DocumentCode
1638310
Title
Segmentation-free Word Spotting in Historical Printed Documents
Author
Gatos, B. ; Pratikakis, I.
Author_Institution
Comput. Intell. Lab., Nat. Res. Center Demokritos, Athens, Greece
fYear
2009
Firstpage
271
Lastpage
275
Abstract
In this paper, a new efficient word spotting methodology is presented that can be applied to historical printed documents without requiring any previous block or word segmentation step. Our aim is to address a methodology which is segmentation-free since in many cases of historical documents, the segmentation process does not produce meaningful results due to unconstraint layout, several degradations or typesetting imperfections. The proposed method is based on block-based document image descriptors that are used at a template matching process satisfying invariance in terms of translation, rotation and scaling. Improvement in terms of time expense is obtained by applying the matching process only on salient regions of the image. Experimental results on a database with representative historical printed documents prove the efficiency of the proposed approach.
Keywords
document image processing; history; image matching; block-based document image descriptor; historical printed document image; image rotation; image scaling; image translation; segmentation-free word spotting; template matching process; typesetting imperfection; unconstraint layout; Computational intelligence; Degradation; Image databases; Image retrieval; Image segmentation; Informatics; Laboratories; Optical character recognition software; Text analysis; Typesetting; Historical Documents; Segmentation-free analysis; Word Spotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location
Barcelona
ISSN
1520-5363
Print_ISBN
978-1-4244-4500-4
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2009.236
Filename
5277703
Link To Document