DocumentCode :
2630229
Title :
Understanding mathematical expressions in a printed document
Author :
Lee, Hsi-Jian ; Lee, Min-Chou
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
502
Lastpage :
505
Abstract :
A system for understanding mathematical expressions is presented. After separating all symbols in an input mathematical expression, we utilize thirteen features to represent each symbol. In order to reduce the computational time, a coarse classification algorithm is used to reduce the number of candidates. Then for each input symbol, the character with the highest similarity is selected as the candidate symbol. Since some of the symbols in an arithmetical expression may touch each other, a dynamic programming algorithm is adopted to identify correct characters from connected symbols. In the expression formation stage, a procedure-oriented method is proposed for translating the recognized symbols appearing in a 2D space into a 1D character string. The authors have used 105 mathematical expressions as training data and 50 expressions as testing data. The experimental results have demonstrated the feasibility of the understanding system
Keywords :
character recognition; document handling; dynamic programming; 1D character string; 2D space; arithmetical expression; candidate symbol; coarse classification algorithm; connected symbols; dynamic programming algorithm; expression formation stage; highest similarity; mathematical expressions; printed document; procedure-oriented method; symbol recognition; understanding system; Character recognition; Classification algorithms; Computer science; Feature extraction; Heuristic algorithms; Keyboards; Optical character recognition software; Speech recognition; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395686
Filename :
395686
Link To Document :
بازگشت