Title :
Improvements in BBN´s HMM-Based Offline Arabic Handwriting Recognition System
Author :
Saleem, Shirin ; Cao, Huaigu ; Subramanian, Krishna ; Kamali, Matin ; Prasad, Rohit ; Natarajan, Prem
Author_Institution :
BBN Technol., Cambridge, MA, USA
Abstract :
Offline handwriting recognition of free-flowing Arabic text is a challenging task due to the plethora of factors that contribute to the variability in the data. In this paper, we address some of these sources of variability, and present experimental results on a large corpus of handwritten documents. Specific techniques such as the application of context-dependent Hidden Markov Models (HMMs) for the cursive Arabic script, unsupervised adaptation to account for the stylistic variations across scribes, and image pre-processing to remove ruled-lines are explored. In particular, we proposed a novel integration of structural features in the HMM framework which exclusively results in a 9% relative improvement in performance. Overall, we demonstrate a relative reduction of 17% in word error rate over our baseline Arabic handwriting recognition system.
Keywords :
data analysis; handwriting recognition; hidden Markov models; image recognition; text analysis; free-flowing Arabic text; handwritten document; hidden Markov model; image pre-processing; offline Arabic handwriting recognition system; Handwriting recognition; Hidden Markov models; Image segmentation; Optical character recognition software; Shape; Testing; Text analysis; Text recognition; Vocabulary; Writing; Arabic; Hidden Markov Models; Offline Handwriting Recognition; Structural Features; Writer Adaptation;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.282