• DocumentCode
    3488860
  • Title

    Graphics Extraction from Heterogeneous Online Documents with Hierarchical Random Fields

  • Author

    Delaye, Adrien ; Cheng-Lin Liu

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    1007
  • Lastpage
    1011
  • Abstract
    Graphical objects are important elements of freely handwritten notes but their segmentation from the document is challenging due to their irregular properties. This paper introduces an original solution for automatically segmenting diagrams and drawings from unstructured online documents. We propose a multi-scale representation of the document modeled as a hierarchical Conditional Random Field to predict the detection of graphical elements at the stroke level. An experimental evaluation with realistic documents highlights the benefit of the hierarchical model in comparison with a flat Conditional Random Field and demonstrates the robustness of our system.
  • Keywords
    document image processing; feature extraction; image representation; image segmentation; diagram segmentation; document multiscale representation; document segmentation; drawing segmentation; flat conditional random field; graphical element detection; graphical objects; graphics extraction; handwritten notes; heterogeneous online documents; hierarchical conditional random field; unstructured online documents; Conferences; Feature extraction; Graphics; Handwriting recognition; Mathematical model; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.202
  • Filename
    6628767