DocumentCode
3326944
Title
Design of a mathematical expression recognition system
Author
Lee, Hsi-Jian ; Wang, Jiumn-Shine
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume
2
fYear
1995
fDate
14-16 Aug 1995
Firstpage
1084
Abstract
We present a system to segment and recognize texts and mathematical expressions in a document. The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. In expression formation, we build a symbol relation tree for each text line to represent the relationships among the symbols in the text line. Some heuristic rules based on the primitive tokens are used to correct the recognition errors in a text line. We extract all mathematical expressions according to some basic expression forms. Our database consists of 190 symbols in the current stage. The average recognition rate is about 96.16%
Keywords
character recognition; document image processing; feature extraction; image segmentation; character segmentation; feature extraction; heuristic rules; labeling; mathematical equations understanding; mathematical expression recognition; mathematical expressions; page segmentation; scientific documents; Character recognition; Equations; Error correction; Graphics; Image edge detection; Image segmentation; Labeling; Layout; Optical character recognition software; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-8186-7128-9
Type
conf
DOI
10.1109/ICDAR.1995.602097
Filename
602097
Link To Document