DocumentCode
2799274
Title
Segmentation of Printed Farsi/Arabic Words
Author
Broumandnia, A. ; Shanbehzadeh, J. ; Nourani, M.
Author_Institution
lslamic Azad Univ.-Tehran South Branch, Tehran
fYear
2007
fDate
13-16 May 2007
Firstpage
761
Lastpage
766
Abstract
Characters connectivity is a problem in automated printed Farsi/Arabic script recognition. This paper introduces a novel scheme based on wavelet transform to solve segmentation of printed Farsi/Arabic words into characters. Our novel algorithm employs a new wavelet transform by which the extracted wavelet coefficients are exploited, in detecting, underlying horizontal edges and base line. Projection of horizontal edges and their location on base line provide the segmentation points. A classification method distinguishes true segmenting points. New algorithm is robust against noise, gray level, font and size of characters. Simulation results provide a comparison between new algorithm and three schemes, closed contour, structural and holistic, in terms of precision, speed and robustness against Gaussian noise. Experimental Results indicate superiority of our scheme in terms of precision and show that new algorithm improves recognition speed by a factor of at least 2.5 times.
Keywords
edge detection; image classification; image segmentation; natural language processing; text analysis; wavelet transforms; classification method; printed Arabic script recognition; printed Arabic word segmentation; printed Farsi script recognition; printed Farsi word segmentation; wavelet coefficient extraction; wavelet transform; Background noise; Character recognition; Discrete wavelet transforms; Image edge detection; Image segmentation; Noise level; Noise robustness; Wavelet coefficients; Wavelet domain; Wavelet transforms; Image Processing; Machine Vision; OCR; Pattern Recognition; Wavelet Transform;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Systems and Applications, 2007. AICCSA '07. IEEE/ACS International Conference on
Conference_Location
Amman
Print_ISBN
1-4244-1030-4
Electronic_ISBN
1-4244-1031-2
Type
conf
DOI
10.1109/AICCSA.2007.370718
Filename
4231046
Link To Document