Title :
Performance Evaluation of Mathematical Formula Identification
Author :
Lin, Xiaoyan ; Gao, Liangcai ; Tang, Zhi ; Lin, Xiaofan ; Hu, Xuan
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Abstract :
This paper presents a performance evaluation system for mathematical formula identification. First, a ground-truth dataset is constructed to facilitate the performance comparison of different mathematical formula identification algorithms. Statistics analysis of the dataset shows the diversities of the dataset to reflect the real-world documents. Second, a performance evaluation metric for mathematical formula identification is proposed, including the error type definitions and the scenario-adjustable scoring. The proposed metric enables in-depth analysis of mathematical formula identification systems in different scenarios. Finally, based on the proposed evaluation metric, a tool is developed to automatically evaluate mathematical formula identification results. It is worth noting that the ground-truth dataset and the evaluation tool are freely available for academic purpose.
Keywords :
document handling; mathematics computing; statistical analysis; ground truth dataset; mathematical formula identification; performance evaluation; performance evaluation metric; real-world documents; scenario adjustable scoring; statistics analysis; Educational institutions; Layout; Mathematics; Performance evaluation; Portable document format; Text analysis; Mathematical formula identification; evaluation metric; ground truth; performance evaluation;
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
DOI :
10.1109/DAS.2012.68