DocumentCode :
3487730
Title :
High-Performance OCR for Printed English and Fraktur Using LSTM Networks
Author :
Breuel, Thomas M. ; Ul-Hasan, Adnan ; Al-Azawi, Mayce Ali ; Shafait, Faisal
Author_Institution :
Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
683
Lastpage :
687
Abstract :
Long Short-Term Memory (LSTM) networks have yielded excellent results on handwriting recognition. This paper describes an application of bidirectional LSTM networks to the problem of machine-printed Latin and Fraktur recognition. Latin and Fraktur recognition differs significantly from handwriting recognition in both the statistical properties of the data, as well as in the required, much higher levels of accuracy. Applications of LSTM networks to handwriting recognition use two-dimensional recurrent networks, since the exact position and baseline of handwritten characters is variable. In contrast, for printed OCR, we used a one-dimensional recurrent network combined with a novel algorithm for baseline and x-height normalization. A number of databases were used for training and testing, including the UW3 database, artificially generated and degraded Fraktur text and scanned pages from a book digitization project. The LSTM architecture achieved 0.6% character-level test-set error on English text. When the artificially degraded Fraktur data set is divided into training and test sets, the system achieves an error rate of 1.64%. On specific books printed in Fraktur (not part of the training set), the system achieves error rates of 0.15% (Fontane) and 1.47% (Ersch-Gruber). These recognition accuracies were found without using any language modelling or any other post-processing techniques.
Keywords :
handwriting recognition; natural language processing; optical character recognition; statistical analysis; text analysis; English text; Fraktur; Fraktur text; LSTM networks; UW3 database; book digitization project; handwriting recognition; handwritten characters; high-performance OCR; long short term memory networks; machine printed Fraktur recognition; machine printed Latin recognition; printed English; printed OCR; recurrent networks; scanned pages; statistical properties; Error analysis; Handwriting recognition; Hidden Markov models; Optical character recognition software; Recurrent neural networks; Training; LSTM Networks; OCR; RNN;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.140
Filename :
6628705
Link To Document :
بازگشت