DocumentCode
3488860
Title
Graphics Extraction from Heterogeneous Online Documents with Hierarchical Random Fields
Author
Delaye, Adrien ; Cheng-Lin Liu
Author_Institution
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
1007
Lastpage
1011
Abstract
Graphical objects are important elements of freely handwritten notes but their segmentation from the document is challenging due to their irregular properties. This paper introduces an original solution for automatically segmenting diagrams and drawings from unstructured online documents. We propose a multi-scale representation of the document modeled as a hierarchical Conditional Random Field to predict the detection of graphical elements at the stroke level. An experimental evaluation with realistic documents highlights the benefit of the hierarchical model in comparison with a flat Conditional Random Field and demonstrates the robustness of our system.
Keywords
document image processing; feature extraction; image representation; image segmentation; diagram segmentation; document multiscale representation; document segmentation; drawing segmentation; flat conditional random field; graphical element detection; graphical objects; graphics extraction; handwritten notes; heterogeneous online documents; hierarchical conditional random field; unstructured online documents; Conferences; Feature extraction; Graphics; Handwriting recognition; Mathematical model; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.202
Filename
6628767
Link To Document