DocumentCode :
2832203
Title :
Automatic segmentation for Arabic characters in handwriting documents
Author :
Lawgali, A. ; Bouridane, A. ; Angelova, M. ; Ghassemlooy, Z.
Author_Institution :
Sch. of Comput., Eng. & Inf. Sci., Northumbria Univ., Newcastle upon Tyne, UK
fYear :
2011
fDate :
11-14 Sept. 2011
Firstpage :
3529
Lastpage :
3532
Abstract :
The cursive and ligature nature of the Arabic script make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other scripts to Arabic script, it is generally insufficient to segment the Arabic text. This paper proposes a new segmentation algorithm for the handwritten Arabic text and the main idea consists of segmenting the word into sub-words and then computing the baseline of each sub-word. Using the descenders of sub-words and the baseline, candidate points are then calculated using a vertical projection. The algorithm has been tested using 800 handwritten Arabic words taken from the IFN/ENIT database and a comparison made against some existing methods and promising results have been obtained.
Keywords :
handwritten character recognition; image segmentation; natural language processing; visual databases; word processing; Arabic script; IFN/ENIT database; cursive Latin scripts; handwritten Arabic text; handwritten Arabic words; segmentation algorithm; segmentation mentation; subword segmentation; Conferences; Databases; Handwriting recognition; Image segmentation; Noise; Shape; Skeleton; Arabic character segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2011 18th IEEE International Conference on
Conference_Location :
Brussels
ISSN :
1522-4880
Print_ISBN :
978-1-4577-1304-0
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2011.6116476
Filename :
6116476
Link To Document :
بازگشت