DocumentCode
3487097
Title
Improving HMM-Based Keyword Spotting with Character Language Models
Author
Fischer, Anath ; Frinken, Volkmar ; Bunke, Horst ; Suen, Ching
Author_Institution
CENPARMI, Concordia Univ., Montreal, QC, Canada
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
506
Lastpage
510
Abstract
Facing high error rates and slow recognition speed for full text transcription of unconstrained handwriting images, keyword spotting is a promising alternative to locate specific search terms within scanned document images. We have previously proposed a learning-based method for keyword spotting using character hidden Markov models that showed a high performance when compared with traditional template image matching. In the lexicon-free approach pursued, only the text appearance was taken into account for recognition. In this paper, we integrate character n-gram language models into the spotting system in order to provide an additional language context. On the modern IAM database as well as the historical George Washington database, we demonstrate that character language models significantly improve the spotting performance.
Keywords
document image processing; handwriting recognition; hidden Markov models; image matching; learning (artificial intelligence); HMM-based keyword spotting; IAM database; character hidden Markov models; character language models; character n-gram language models; full text transcription; historical George Washington database; learning-based method; lexicon-free approach; scanned document images; template image matching; text appearance; unconstrained handwriting images; Character recognition; Databases; Handwriting recognition; Hidden Markov models; Mathematical model; Text recognition; Viterbi algorithm; handwriting recognition; hidden Markov models; keyword spotting; language models;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.107
Filename
6628672
Link To Document