DocumentCode :
177522
Title :
EM-Based Layout Analysis Method for Structured Documents
Author :
Cruz, F. ; Ramos Terrades, O.
Author_Institution :
Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
315
Lastpage :
320
Abstract :
In this paper we present a method to perform layout analysis in structured documents. We proposed an EM-based algorithm to fit a set of Gaussian mixtures to the different regions according to the logical distribution along the page. After the convergence, we estimate the final shape of the regions according to the parameters computed for each component of the mixture. We evaluated our method in the task of record detection in a collection of historical structured documents and performed a comparison with other previous works in this task.
Keywords :
Gaussian processes; convergence; document image processing; expectation-maximisation algorithm; EM-based algorithm; Gaussian mixtures; convergence; document analysis; historical structured documents; layout analysis; logical distribution; record detection; Computational modeling; Covariance matrices; Image segmentation; Layout; Semantics; Text analysis; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.63
Filename :
6976774
Link To Document :
بازگشت