Title :
Historical Document Layout Analysis Competition
Author :
Antonacopoulos, A. ; Clausner, C. ; Papadopoulos, C. ; Pletschacher, S.
Author_Institution :
Pattern Recognition & Image Anal. (PRImA) Res. Lab., Univ. of Salford, Salford, UK
Abstract :
This paper presents an objective comparative evaluation of layout analysis methods for scanned historical documents. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2011 and the International Workshop on Historical Document Imaging and Processing (HIP2011), presenting the results of the evaluation of four submitted methods. A commercial state-of-the-art system is also evaluated for comparison. Two scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and the other evaluating the whole pipeline of segmentation and region classification (with a text extraction goal). The results indicate that there is a convergence to a certain methodology with some variations in the approach. However, there is still a considerable need to develop robust methods that deal with the idiosyncrasies of historical documents.
Keywords :
document image processing; history; image classification; image segmentation; ICDAR2011; International Workshop on Historical Document Imaging and Processing; historical document layout analysis competition; objective comparative evaluation; page segmentation; region classification; scanned historical document; text extraction goal; Corporate acquisitions; Image segmentation; Layout; Libraries; Particle separators; Performance evaluation; Text analysis; datasets; historical documents; layout analysis; page segmentation; performance evaluation; region classification;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.301