DocumentCode
2835824
Title
Segmentation-free word recognition with application to Arabic
Author
Al-Badr, Badr ; Haralick, Robert M.
Author_Institution
Intelligent Syst. Lab., Washington Univ., Seattle, WA, USA
Volume
1
fYear
1995
fDate
14-16 Aug 1995
Firstpage
355
Abstract
This paper describes the design and implementation of a system that recognizes machine-printed Arabic words without prior segmentation. The technique is based on describing symbols in terms of shape primitives. At recognition time, the primitives are detected on a word image using mathematical morphology operations. The system then matches the detected primitives with symbol models. This leads to a spatial arrangement of matched symbol models. The system conducts a search in the space of spatial arrangements of models and outputs the arrangement with the highest posterior probability as the recognition of the word. The advantage of using this whole word approach versus a segmentation approach is that the result of recognition is optimized with regard to the whole word. Results of preliminary experiments using a lexicon of 42,000 words show a recognition rate of 99.4% for noise-free text and 73% for scanned text
Keywords
character recognition; image recognition; mathematical morphology; design; implementation; lexicon; machine-printed Arabic words; matched symbol models; mathematical morphology; posterior probability; segmentation-free word recognition; shape primitives; spatial arrangement; spatial arrangements; symbols; word image; Character recognition; Image recognition; Image segmentation; Intelligent systems; Laboratories; Morphology; Noise shaping; Optical character recognition software; Shape; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-8186-7128-9
Type
conf
DOI
10.1109/ICDAR.1995.599012
Filename
599012
Link To Document