• DocumentCode
    2199017
  • Title

    Are Characters Objects?

  • Author

    Diem, Markus ; Sablatnig, Robert

  • Author_Institution
    Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    565
  • Lastpage
    570
  • Abstract
    This paper presents a character recognition system that handles degraded manuscript documents like the ones discovered at the St. Catherine´s Monastery. In contrast to state-of-the-art OCR systems, no early decision (image binarization) needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. The proposed system is based on local descriptors which are clustered in order to localize characters. Finally, a class probability histogram is assigned to each character present in an image which allows for the character classification. The system achieves an F0.5 score of 0.77 on real world data that contains 13.5% highly degraded characters.
  • Keywords
    image enhancement; optical character recognition; probability; OCR systems; character classification; character recognition system; object recognition methodology; probability histogram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.93
  • Filename
    5693623