• DocumentCode
    1994657
  • Title

    Automatic understanding of structures in printed mathematical expressions

  • Author

    Mitra, Joydip ; Garain, Utpal ; Chaudhuri, B.B. ; Kumar Swamy HV ; Pal, Tamaltaru

  • Author_Institution
    Indian Stat. Inst., Kolkata, India
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    540
  • Abstract
    Recognizing mathematical expressions from document image is a key problem in automatic conversion of scientific documents into electronic form. In this paper, we propose a simple grammar-based approach to recognize complex two-dimensional structures of printed mathematical expressions with high accuracy. The proposed technique is based on the structural information of symbols in an expression. An efficient implementation of the grammar is presented. The system generates a TEX string for the input expression. A new criterion for defining structural complexity of a mathematical expression has been formulated to measure the performance of the proposed technique. Experiment using a good representative sample of mathematical expressions shows a reasonably high efficiency of the system.
  • Keywords
    character recognition; document image processing; TEX string; document image; electronic form; grammar-based approach; printed mathematical expression; scientific document conversion; structural complexity; symbol structural information; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227723
  • Filename
    1227723