Title :
ExpressMatch: A System for Creating Ground-Truthed Datasets of Online Mathematical Expressions
Author :
Aguilar, Frank D J ; Hirata, Nina S T
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Paulo, Brazil
Abstract :
In recognition domains, publicly available ground-truthed datasets are essential to perform effective performance evaluation and comparison of existing methods and systems. However, in the field of online handwritten mathematical expression recognition, datasets are quite scarce and their creation is one of the current challenging issues. In this paper, we present Express Match, a system designed to help creation and management of online mathematical expression datasets with ground-truth data. In this system, handwritten model expressions can be input and manually annotated with ground-truth data, transcriptions of these expressions can be automatically annotated by matching them to the respective models. Additional metadata can also be attached to each sample expression. To test the system, a dataset consisting of 56 model expressions and 910 sample expressions with a total of 20,010 symbols, written by 25 different writers, has been created. This dataset, as well as Express Match, will be made publicly available.
Keywords :
handwriting recognition; mathematics computing; ExpressMatch; ground truthed datasets; online handwritten mathematical expression recognition; performance comparison; performance evaluation; Computational modeling; Data models; Handwriting recognition; Integrated circuit modeling; Labeling; Performance evaluation; Writing; ground-truthed dataset; online mathematical expressions; performance evaluation;
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
DOI :
10.1109/DAS.2012.38