DocumentCode
1994657
Title
Automatic understanding of structures in printed mathematical expressions
Author
Mitra, Joydip ; Garain, Utpal ; Chaudhuri, B.B. ; Kumar Swamy HV ; Pal, Tamaltaru
Author_Institution
Indian Stat. Inst., Kolkata, India
fYear
2003
fDate
3-6 Aug. 2003
Firstpage
540
Abstract
Recognizing mathematical expressions from document image is a key problem in automatic conversion of scientific documents into electronic form. In this paper, we propose a simple grammar-based approach to recognize complex two-dimensional structures of printed mathematical expressions with high accuracy. The proposed technique is based on the structural information of symbols in an expression. An efficient implementation of the grammar is presented. The system generates a TEX string for the input expression. A new criterion for defining structural complexity of a mathematical expression has been formulated to measure the performance of the proposed technique. Experiment using a good representative sample of mathematical expressions shows a reasonably high efficiency of the system.
Keywords
character recognition; document image processing; TEX string; document image; electronic form; grammar-based approach; printed mathematical expression; scientific document conversion; structural complexity; symbol structural information; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN
0-7695-1960-1
Type
conf
DOI
10.1109/ICDAR.2003.1227723
Filename
1227723
Link To Document