DocumentCode :
591994
Title :
HMM-based Offline Arabic Handwriting Recognition: Using New Feature Extraction and Lexicon Ranking Techniques
Author :
Eraqi, H.M. ; Abdelazeem, S.
Author_Institution :
Electron. Eng. Dept., American Univ. in Cairo, Cairo, Egypt
fYear :
2012
fDate :
18-20 Sept. 2012
Firstpage :
554
Lastpage :
559
Abstract :
In this paper, a new offline Arabic handwriting recognition system is presented. The Douglas-Peucker algorithm is applied on the skeletonized parts of the offline images to convert it into piecewise linear curves that are used for efficient detection of diacritics, noise segments, and the baseline. A hidden Markov model (HMM)-based system is used with features extracted from the image before and after removing the diacritics. A reliable method of lexicon ranking and reduction based on the information of the image´s diacritics, number of piece of Arabic words (PAWs), and dimensions information is used. The proposed system has been tested using the IFN/ENIT database and has achieved promising recognition rates.
Keywords :
document image processing; feature extraction; handwriting recognition; hidden Markov models; image thinning; natural languages; Douglas-Peucker algorithm; HMM; IFN/ENIT database; PAW; diacritics detection; dimensions information; feature extraction; hidden Markov model; lexicon ranking technique; lexicon reduction; noise segment; offline arabic handwriting recognition; piece of Arabic word; piecewise linear curve; skeletonized part; Feature extraction; Handwriting recognition; Hidden Markov models; Image segmentation; Noise; Speech recognition; Writing; Baseline Detection; Diacritics Detection; Feature Extraction; Handwriting Recognition; Hidden Markov Model; Lexicon Ranking; Lexicon Reduction; Skeletonization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
Type :
conf
DOI :
10.1109/ICFHR.2012.214
Filename :
6424454
Link To Document :
بازگشت