DocumentCode
3695233
Title
Crossing the lines: making optimal use of context in line-based Handwritten Text Recognition
Author
J. Tanha;J.D. Does;K. Depuydt;J.A. Sánchez
Author_Institution
Institute for Dutch Lexicology (INL), Matthias de Vrieshof 3, 2311 BZ Leiden, The Netherlands
fYear
2015
Firstpage
956
Lastpage
960
Abstract
Hand-written text recognition (HTR) is often carried out line-by-line: the decoding of text lines is carried out independently. This approach is known to deteriorate recognition accuracy of words and characters close to the line boundaries. The present study investigates this issue from the point of view of the language modeling component of the HTR system. Obviously, lack of linguistic context may be one of the reasons for loss of accuracy, but it certainly is not the only factor in play. We seek to clarify to which extent the problem can be influenced by the language modeling component of the system. We first discuss how to develop adapted language models which significantly improve HTR performance in general. We then focus on the deployment of methods to improve accuracy at line boundaries. The final result is an efficient approach which significantly improves HTR accuracy without changing the basic HTR system setup.
Keywords
"Hidden Markov models","Adaptation models","Accuracy","Image recognition","Chlorine"
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type
conf
DOI
10.1109/ICDAR.2015.7333903
Filename
7333903
Link To Document