Title :
Identification of Mathematical Expressions in Document Images
Author_Institution :
Indian Stat. Inst., Kolkata, India
Abstract :
Identification of mathematical expressions in document images is attempted. Several features starting from low level image features, shape features, linguistics information, etc. are extracted and combined to pinpoint the expressions. A performance index has been formulated to evaluate the identification results. Experiment uses a data set of 200 publicly available images containing 1163 embedded and 1039 displayed expressions. Test results show accuracies of 88.3% and 97.2%, respectively for extracting embedded and displayed expressions perfectly.
Keywords :
document image processing; feature extraction; mathematical analysis; document image; feature extraction; linguistic information; mathematical expression; Character recognition; Data mining; Flexible manufacturing systems; Image analysis; Image recognition; Optical character recognition software; Performance analysis; Shape; Testing; Text analysis; Layout analysis; Mathematical Expression; Optical Character Recognition; Performance Evaluation;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.203