Title :
Are Characters Objects?
Author :
Diem, Markus ; Sablatnig, Robert
Author_Institution :
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
Abstract :
This paper presents a character recognition system that handles degraded manuscript documents like the ones discovered at the St. Catherine´s Monastery. In contrast to state-of-the-art OCR systems, no early decision (image binarization) needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. The proposed system is based on local descriptors which are clustered in order to localize characters. Finally, a class probability histogram is assigned to each character present in an image which allows for the character classification. The system achieves an F0.5 score of 0.77 on real world data that contains 13.5% highly degraded characters.
Keywords :
image enhancement; optical character recognition; probability; OCR systems; character classification; character recognition system; object recognition methodology; probability histogram;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
DOI :
10.1109/ICFHR.2010.93