• DocumentCode
    2835824
  • Title

    Segmentation-free word recognition with application to Arabic

  • Author

    Al-Badr, Badr ; Haralick, Robert M.

  • Author_Institution
    Intelligent Syst. Lab., Washington Univ., Seattle, WA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    14-16 Aug 1995
  • Firstpage
    355
  • Abstract
    This paper describes the design and implementation of a system that recognizes machine-printed Arabic words without prior segmentation. The technique is based on describing symbols in terms of shape primitives. At recognition time, the primitives are detected on a word image using mathematical morphology operations. The system then matches the detected primitives with symbol models. This leads to a spatial arrangement of matched symbol models. The system conducts a search in the space of spatial arrangements of models and outputs the arrangement with the highest posterior probability as the recognition of the word. The advantage of using this whole word approach versus a segmentation approach is that the result of recognition is optimized with regard to the whole word. Results of preliminary experiments using a lexicon of 42,000 words show a recognition rate of 99.4% for noise-free text and 73% for scanned text
  • Keywords
    character recognition; image recognition; mathematical morphology; design; implementation; lexicon; machine-printed Arabic words; matched symbol models; mathematical morphology; posterior probability; segmentation-free word recognition; shape primitives; spatial arrangement; spatial arrangements; symbols; word image; Character recognition; Image recognition; Image segmentation; Intelligent systems; Laboratories; Morphology; Noise shaping; Optical character recognition software; Shape; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.599012
  • Filename
    599012