Title :
OCD Dolores - Recovering Logical Structures for Dummies
Author :
Bloechle, Jean-Luc ; Rigamonti, Maurizio ; Ingold, Rolf
Author_Institution :
Dept. of Comput. Sci., Univ. of Fribourg, Fribourg, Switzerland
Abstract :
This paper presents OCD Dolores, an environment that aims at recovering the logical structures from documents by interactively inferring their models. Dolores is based on OCD, an XML canonical document format used to represent structured electronic content efficiently. The relevance of our restructuring system is assessed through a deep evaluation of Dolores´ logical labeling capacities.
Keywords :
XML; document image processing; inference mechanisms; optical character recognition; Dolore logical labeling capacities; OCD Dolores; XML canonical document format; document logical restructuring; document models; dummies; interactive inference; logical structure recovery; structured electronic content represent; Electronic publishing; Feature extraction; Labeling; Learning systems; Text analysis; Text recognition; Training; Dolores; OCD; XED; document model; document structures; interactive learning; learning system; logical structure;
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
DOI :
10.1109/DAS.2012.58