• DocumentCode
    3222331
  • Title

    On developing high accuracy OCR systems for Telugu and other Indian scripts

  • Author

    Bhagvati, Chakravarthy ; Ravi, Tanuku ; Kumar, S. Mohan ; Negi, Atul

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Hyderabad Univ., India
  • fYear
    2002
  • fDate
    13-15 Dec. 2002
  • Firstpage
    18
  • Lastpage
    23
  • Abstract
    In this paper we list a number of factors that are important in achieving high recognition accuracy in OCR systems for Telugu and other Indian scripts. While it is relatively easy to obtain 85%-93% accuracy, it becomes increasingly difficult to improve the performance further We discuss how the factors presented in this paper helped achieve an accuracy of nearly 97% with our OCR system for Telugu script. It is expected that these factors are specific not only to Telugu but also work for other Indian scripts in general and south Indian scripts in particular.
  • Keywords
    optical character recognition; Hindi; Indian scripts; OCR systems; Optical Character Recognition systems; Telugu; Acoustical engineering; Character recognition; Collaboration; Error correction; Government; Optical character recognition software; Shape; System performance; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Language Engineering Conference, 2002. Proceedings
  • Print_ISBN
    0-7695-1885-0
  • Type

    conf

  • DOI
    10.1109/LEC.2002.1182286
  • Filename
    1182286