• DocumentCode
    2198654
  • Title

    Ontology-Based Information Extraction from Handwritten Documents

  • Author

    Ebert, Sebastian ; Liwicki, Marcus ; Dengel, Andreas

  • Author_Institution
    German Res. Center for Artificial Intell., Kaiserslautern, Germany
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    483
  • Lastpage
    488
  • Abstract
    In this paper we introduce a new layer for the task of handwriting recognition. We add semantic information by means of ontologies. The task of our recognizer therefore is not only to recognize the ASCII transcription of the handwritten document, but also to identify the semantic concepts which appear in the text. This task is called ontology-based information extraction (OBIE), which has been applied to electronic documents recently. OBIE methods first segment the text into tokens, then identify their values and their corresponding instances of the ontology, and finally try to generate new facts based on the text. To the authors´ knowledge, in this paper OBIE is proposed for the first time in handwriting literature. In our experiments we have evaluated the process up to the instantiation. We have found that using not only the top alternative, but also the k-best alternatives increases the performance of information extraction. Furthermore, the use of an ontology-based lexicon results in another performance increase.
  • Keywords
    document handling; handwriting recognition; ontologies (artificial intelligence); ASCII transcription; OBIE methods; handwritten documents; ontology-based information extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.82
  • Filename
    5693610