DocumentCode
2271393
Title
Using local features for efficient layout analysis of ancient manuscripts
Author
Garz, Angelika ; Sablatnig, Robert ; Diem, Markus
Author_Institution
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
fYear
2011
fDate
Aug. 29 2011-Sept. 2 2011
Firstpage
1259
Lastpage
1263
Abstract
A binarization-free layout analysis method for ancient manuscripts is proposed, which identifies and localizes layout entities exploiting their structural similarities on the local level. Hence, the textual entities are disassembled into segments, and a part-based detection is done which employs local gradient features known from the field of object recognition, the Scale Invariant Feature Transform (SIFT), to describe these structures. Layout analysis is the first step in the process of document understanding; it identifies regions of interest and, hence, serves as input for other algorithms such as Optical Character Recognition (OCR). Moreover, the document layout allows scholars to establish the spatio-temporal origin, authenticate, or index a document. The layout entities considered in this approach include the body text, embellished initials, plain initials and headings.
Keywords
document image processing; history; image segmentation; transforms; SIFT; ancient manuscripts layout analysis; binarization-free layout analysis method; document body text; document embellished initials; document headings; document layout analysis; document plain initials; document understanding; local gradient features; page segmentation; part-based detection; regions of interest identification; scale invariant feature transform; structural similarities; textual entities; Feature extraction; Image color analysis; Image segmentation; Ink; Layout; Shape; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2011 19th European
Conference_Location
Barcelona
ISSN
2076-1465
Type
conf
Filename
7074177
Link To Document