DocumentCode
183268
Title
Word-Graph and Character-Lattice Combination for KWS in Handwritten Documents
Author
Puigcerver, Joan ; Toselli, Alejandro Hector ; Vidal, Enrique
Author_Institution
PRHLT Res. Center, Univ. Politec. de Valencia, Valencia, Spain
fYear
2014
fDate
1-4 Sept. 2014
Firstpage
181
Lastpage
186
Abstract
We present a handwritten text Keyword Spotting (KWS) approach based on the combination of KWS methods using word-graphs (WGs) and character-lattices (CLs). It aims to solve the problem that WG-based models present for out of vocabulary (OOV) keywords: since there is no available information about them in the lexicon or the language model, null scores are assigned. OOV keywords may have a significant impact on the global performance of KWS systems, as we show. By using a CL approach, which does not suffer from the previous problem, to estimate the OOV scores, we take advantage of both models, using the speed and accuracy that WGs provide for in-vocabulary keywords and the flexibility of the CL approach. This combination improves significantly both average precision and mean average precision over the two methods.
Keywords
handwritten character recognition; text detection; CL approach; KWS methods; OOV keywords; OOV score estimation; WG-based models; character-lattices; global performance; handwritten documents; handwritten text KWS approach; handwritten text keyword spotting approach; in-vocabulary keywords; language model; lexicon model; mean average precision; null-score assignment; out-of-vocabulary keywords; word-graphs; Computational modeling; Decoding; Hidden Markov models; Image segmentation; Training; Viterbi algorithm; Vocabulary; character lattice; handwriting text recognition; information retrieval; out of vocabulary; spotting; word graph;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location
Heraklion
ISSN
2167-6445
Print_ISBN
978-1-4799-4335-7
Type
conf
DOI
10.1109/ICFHR.2014.38
Filename
6981017
Link To Document