• DocumentCode
    3326944
  • Title

    Design of a mathematical expression recognition system

  • Author

    Lee, Hsi-Jian ; Wang, Jiumn-Shine

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    2
  • fYear
    1995
  • fDate
    14-16 Aug 1995
  • Firstpage
    1084
  • Abstract
    We present a system to segment and recognize texts and mathematical expressions in a document. The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. In expression formation, we build a symbol relation tree for each text line to represent the relationships among the symbols in the text line. Some heuristic rules based on the primitive tokens are used to correct the recognition errors in a text line. We extract all mathematical expressions according to some basic expression forms. Our database consists of 190 symbols in the current stage. The average recognition rate is about 96.16%
  • Keywords
    character recognition; document image processing; feature extraction; image segmentation; character segmentation; feature extraction; heuristic rules; labeling; mathematical equations understanding; mathematical expression recognition; mathematical expressions; page segmentation; scientific documents; Character recognition; Equations; Error correction; Graphics; Image edge detection; Image segmentation; Labeling; Layout; Optical character recognition software; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.602097
  • Filename
    602097