• DocumentCode
    3695258
  • Title

    Table information extraction and structure recognition using query patterns

  • Author

    T Kasar;T K Bhowmik;A Belaïd

  • Author_Institution
    LORIA - Universite de Lorraine, BP 239 - Loria Campus Scientifique, 54506 Nancy Cedex, FRANCE
  • fYear
    2015
  • Firstpage
    1086
  • Lastpage
    1090
  • Abstract
    In this paper, we present a query-based approach to selectively extract tabular information and recognize the table structure from scanned documents. Unlike conventional table processing paradigms, we adopt a client-driven approach where clients provide a query pattern by specifying a set of key-fields in the document image. The query pattern is first transformed into an attributed relational graph where each node is described with features and the edges with spatial relationships between the nodes. A fast graph matching technique is then used to retrieve other similar graphs from the document image. Further, the extracted graphs are collectively analyzed to deduce the overall tabular structure. Experiments on a dataset of 101 commercial transaction documents demonstrate the effectiveness of the proposed method.
  • Keywords
    "Power capacitors","Pattern recognition"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333928
  • Filename
    7333928