• DocumentCode
    1012620
  • Title

    Machine recognition and correction of printed Arabic text

  • Author

    Amin, Adnan ; Mar, Jean F.

  • Author_Institution
    Dept. of Math., Kuwait Univ., Kuwait
  • Volume
    19
  • Issue
    5
  • fYear
    1989
  • Firstpage
    1300
  • Lastpage
    1306
  • Abstract
    A method for automatic recognition of a multifont Arabic text entered from a scanner of 300 dpi density is presented. The system is based on two components, one for character recognition and one for word recognition. Character recognition is further divided into three phases: the digitization process, segmentation of words into characters, and identification of characters. The word recognition component is based on the Viterbi algorithm and can handle some identification errors. Character recognition was achieved despite several impeding properties of the Arabic script, especially the connectivity of characters. The processing speed is close to three characters per second with a 90% recognition rate. All algorithms were written in Pascal and run on an IBM PC/AT
  • Keywords
    optical character recognition; IBM PC/AT; OCR; Pascal; Viterbi algorithm; character recognition; computerised pattern recognition; digitization process; identification; printed Arabic text; segmentation; Character recognition; Control systems; Control theory; Costs; Design methodology; Differential equations; Hierarchical systems; Interconnected systems; Stability analysis; Text recognition;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/21.44052
  • Filename
    44052