DocumentCode :
3489027
Title :
Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks
Author :
Ul-Hasan, Adnan ; Bin Ahmed, Saad ; Rashid, Faisal ; Shafait, Faisal ; Breuel, Thomas M.
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1061
Lastpage :
1065
Abstract :
Recurrent neural networks (RNN) have been successfully applied for recognition of cursive handwritten documents, both in English and Arabic scripts. Ability of RNNs to model context in sequence data like speech and text makes them a suitable candidate to develop OCR systems for printed Nabataean scripts (including Nastaleeq for which no OCR system is available to date). In this work, we have presented the results of applying RNN to printed Urdu text in Nastaleeq script. Bidirectional Long Short Term Memory (BLSTM) architecture with Connectionist Temporal Classification (CTC) output layer was employed to recognize printed Urdu text. We evaluated BLSTM networks for two cases: one ignoring the character´s shape variations and the second is considering them. The recognition error rate at character level for first case is 5.15% and for the second is 13.6%. These results were obtained on synthetically generated UPTI dataset containing artificially degraded images to reflect some real-world scanning artifacts along with clean images. Comparison with shape-matching based method is also presented.
Keywords :
image classification; image matching; optical character recognition; recurrent neural nets; Arabic scripts; BLSTM networks; CTC; English scripts; OCR systems; RNN; bidirectional LSTM networks; bidirectional long short term memory; connectionist temporal classification; cursive handwritten documents; offline printed Urdu Nastaleeq script recognition; real-world scanning artifacts; recurrent neural networks; shape-matching; Error analysis; Feature extraction; Handwriting recognition; Optical character recognition software; Recurrent neural networks; Shape; Training; BLSTM Networks; RNN; Urdu OCR;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.212
Filename :
6628777
Link To Document :
بازگشت