DocumentCode
2199017
Title
Are Characters Objects?
Author
Diem, Markus ; Sablatnig, Robert
Author_Institution
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
565
Lastpage
570
Abstract
This paper presents a character recognition system that handles degraded manuscript documents like the ones discovered at the St. Catherine´s Monastery. In contrast to state-of-the-art OCR systems, no early decision (image binarization) needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. The proposed system is based on local descriptors which are clustered in order to localize characters. Finally, a class probability histogram is assigned to each character present in an image which allows for the character classification. The system achieves an F0.5 score of 0.77 on real world data that contains 13.5% highly degraded characters.
Keywords
image enhancement; optical character recognition; probability; OCR systems; character classification; character recognition system; object recognition methodology; probability histogram;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.93
Filename
5693623
Link To Document