Title :
Feature Design for Offline Arabic Handwriting Recognition: Handcrafted vs Automated?
Author :
Chherawala, Youssouf ; Roy, Partha Pratim ; Cheriet, Mohamed
Author_Institution :
Ecole de Technol. Super., Synchromedia Lab., Montreal, QC, Canada
Abstract :
In handwriting recognition, design of relevant feature is a very important but daunting task. On one hand, handcraft design of features is difficult, depending on expert knowledge and on heuristics. On the other hand, biologically inspired neural networks are able to learn automatically features from the input image, but requires a good underlying model. The goal of this paper is to evaluate the performance of automatically learned features compared to handcrafted features, as they provide a promising alternative to the difficult task of features handcrafting. In this work, the recognition model is based on the long short-term memory (LSTM) and connectionist temporal classification (CTC) neural networks. This model has been shown to outperform the well-known HMM model for various handwriting tasks, thanks to its reliable probabilistic modeling. In its multidimensional form, called MDLSTM, this network is able to automatically learn features from the input image. For evaluation, we compare the MDLSTM learned features and four state-of-the-art handcrafted features. The IFN/ENIT database has been used as benchmark for Arabic word recognition, where the results are promising.
Keywords :
feature extraction; handwritten character recognition; hidden Markov models; image classification; learning (artificial intelligence); natural language processing; neural nets; probability; visual databases; Arabic word recognition; CTC neural networks; HMM model; IFN-ENIT database; LSTM; MDLSTM learned features; automatically learned features; biologically inspired neural networks; connectionist temporal classification neural networks; feature design; handcraft design; long short-term memory; offline Arabic handwriting recognition model; performance evaluation; probabilistic modeling; Feature extraction; Handwriting recognition; Hidden Markov models; Image recognition; Recurrent neural networks; Training;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.65