• DocumentCode
    2870947
  • Title

    A syntactic approach for processing mathematical expressions in printed documents

  • Author

    Garain, U. ; Chaudhuri, B.B.

  • Author_Institution
    Comput. Vision & Pattern Recognition Univ., Indian Statical Inst., Calcutta, India
  • Volume
    4
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    523
  • Abstract
    We propose an approach for understanding mathematical expressions in printed documents. The overall approach is divided into three main steps: (i) detection of mathematical expressions in a document, (ii) recognition of the symbols present in the expression and (iii) arrangement of the recognized symbols. The detection of mathematical expressions is done through recognition of a few most common symbols and exploiting some structural features of the expressions. A hybrid of feature based and a template-based technique is used for the recognition of symbols. A two-pass approach is used for arrangement of the symbols. The first pass (scanning or lexical analysis) performs a micro-level examination of the symbols in order to identify the symbol groups occurring in them and to determine their categories or descriptors. The second pass (parsing or syntax analysis) processes the descriptors synthesized in the first pass, to determine the syntactic structure of the expression. A set of predefined rules guides the activities in both the passes. Experiments conducted using this approach on a large number of documents show high accuracy
  • Keywords
    document image processing; grammars; optical character recognition; feature based technique; lexical analysis; mathematical expressions; micro-level symbol examination; parsing; printed documents; scanning; symbol groups; syntactic approach; syntax analysis; template-based technique; two-pass approach; Books; Computer vision; Document handling; Equations; Optical character recognition software; Pattern recognition; Performance analysis; Testing; White spaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2000. Proceedings. 15th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-0750-6
  • Type

    conf

  • DOI
    10.1109/ICPR.2000.902972
  • Filename
    902972