DocumentCode :
389314
Title :
Typeset mathematical expression analysis
Author :
Jin, Jian-Ming ; Han, Zhi ; Wang, Qing-Ren
Author_Institution :
Inst. of Machine Intelligence, Nankai Univ., Tianjin, China
Volume :
2
fYear :
2002
fDate :
2002
Firstpage :
1038
Abstract :
Many mathematical expressions can be found in scientific papers, but no OCR system can recognize a scanned expression. A typeset mathematical expression analysis method, with the main idea to decompose an expression into a serial of sub-expressions and to determine the relations among these sub-expressions, is presented in this paper. The decomposition process, which is hierarchical and recursive, is illustrated by the decomposition tree. Eleven relations are defined, and the actual relationship among the sub-expressions is determined in 5 steps. In order to reduce the complexity of the original expression, mathematical glyphs are divided into 3 levels. Experimental results show that this method is good for analyzing a variety of complex expressions.
Keywords :
character recognition; document image processing; edge detection; trees (mathematics); complexity; decomposition tree; document image processing; mathematical glyphs; multiple line expression detection; typeset mathematical expression; Document image processing; Equations; Image analysis; Image recognition; Image segmentation; MATLAB; Machine intelligence; Optical character recognition software; Text recognition; Typesetting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
Type :
conf
DOI :
10.1109/ICMLC.2002.1174541
Filename :
1174541
Link To Document :
بازگشت