Title :
Fast HMM-Filler Approach for Key Word Spotting in Handwritten Documents
Author :
Toselli, Alejandro Hector ; Vidal, Enrique
Author_Institution :
Univ. Politec. de Valencia, Valencia, Spain
Abstract :
The so-called filler or garbage Hidden Markov Models (HMM) are among the most widely used models for lexicon-free, query by string key word spotting in the fields of speech recognition and (lately) handwritten text recognition. An important drawback of this approach is the large computational cost of the keyword-specific HMM Viterbi decoding process needed to obtain the confidence scores of each word to be spotted. This paper presents a novel way to compute such confidence scores, directly from character lattices produced during a single Viterbi decoding process using only the "filler" model (i.e. no explicit keyword-specific decoding is needed). Experiments show that, as compared with the classical HMM-filler approach, the proposed method obtains essentially the same spotting results, while requiring between one and two orders of magnitude less query computing time.
Keywords :
Viterbi decoding; document image processing; handwriting recognition; hidden Markov models; information retrieval; HMM Viterbi decoding process; character lattices; fast HMM-filler approach; garbage Hidden Markov Models; handwritten documents; handwritten text recognition; query computing time; single Viterbi decoding process; speech recognition; string key word spotting; Computational modeling; Decoding; Handwriting recognition; Hidden Markov models; Indexing; Training; Viterbi algorithm; Character Lattice; HMM-Filler Model; Spotting;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.106