DocumentCode
389314
Title
Typeset mathematical expression analysis
Author
Jin, Jian-Ming ; Han, Zhi ; Wang, Qing-Ren
Author_Institution
Inst. of Machine Intelligence, Nankai Univ., Tianjin, China
Volume
2
fYear
2002
fDate
2002
Firstpage
1038
Abstract
Many mathematical expressions can be found in scientific papers, but no OCR system can recognize a scanned expression. A typeset mathematical expression analysis method, with the main idea to decompose an expression into a serial of sub-expressions and to determine the relations among these sub-expressions, is presented in this paper. The decomposition process, which is hierarchical and recursive, is illustrated by the decomposition tree. Eleven relations are defined, and the actual relationship among the sub-expressions is determined in 5 steps. In order to reduce the complexity of the original expression, mathematical glyphs are divided into 3 levels. Experimental results show that this method is good for analyzing a variety of complex expressions.
Keywords
character recognition; document image processing; edge detection; trees (mathematics); complexity; decomposition tree; document image processing; mathematical glyphs; multiple line expression detection; typeset mathematical expression; Document image processing; Equations; Image analysis; Image recognition; Image segmentation; MATLAB; Machine intelligence; Optical character recognition software; Text recognition; Typesetting;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN
0-7803-7508-4
Type
conf
DOI
10.1109/ICMLC.2002.1174541
Filename
1174541
Link To Document