• DocumentCode
    3487645
  • Title

    Novel Sub-character HMM Models for Arabic Text Recognition

  • Author

    Ahmad, Ishtiaq ; Rothacker, Leonard ; Fink, Glenn A. ; Mahmoud, Sabri A.

  • Author_Institution
    Inf. & Comput. Sci, King Fahd Univ. of Pet. & Miner. (KFUPM), Dhahran, Saudi Arabia
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    658
  • Lastpage
    662
  • Abstract
    Hidden Markov Model (HMM) is one of the most widely used classifier for text recognition. In this paper we are presenting novel sub-character HMM models for Arabic text recognition. Modeling at sub-character level allows sharing of common patterns between different contextual forms of Arabic characters as well as between different characters. The number of HMMs gets reduced considerably while still capturing the variations in shape patterns. This results in a compact and efficient recognizer with reduced model set and is expected to be more robust to the imbalance in data distribution. Experimental results using the sub-character model based recognition of handwritten Arabic text as well printed Arabic text are reported.
  • Keywords
    handwritten character recognition; hidden Markov models; natural language processing; text detection; Arabic characters; Arabic text recognition; data distribution; handwritten Arabic text; hidden Markov model; novel sub-character HMM models; reduced model set; Character recognition; Data models; Databases; Handwriting recognition; Hidden Markov models; Shape; Text recognition; Arabic text recognition; Hidden Markov Models; OCR; Parameter sharing; Sub-character HMMs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.135
  • Filename
    6628700