Title :
Handwritten Word Spotting with Corrected Attributes
Author :
Almazan, Jon ; Gordo, Albert ; Fornes, Alicia ; Valveny, Ernest
Author_Institution :
Comput. Vision Center, Univ. Aut`onoma de Barcelona, Barcelona, Spain
Abstract :
We propose an approach to multi-writer word spotting, where the goal is to find a query word in a dataset comprised of document images. We propose an attributes-based approach that leads to a low-dimensional, fixed-length representation of the word images that is fast to compute and, especially, fast to compare. This approach naturally leads to an unified representation of word images and strings, which seamlessly allows one to indistinctly perform query-by-example, where the query is an image, and query-by-string, where the query is a string. We also propose a calibration scheme to correct the attributes scores based on Canonical Correlation Analysis that greatly improves the results on a challenging dataset. We test our approach on two public datasets showing state-of-the-art results.
Keywords :
document image processing; query processing; attributes-based approach; calibration scheme; canonical correlation analysis; corrected attributes; document images; handwritten word spotting; multiwriter word spotting; query-by-string; word images fixed-length representation; Calibration; Computational modeling; Correlation; Hidden Markov models; Histograms; Training; Writing; attibutes; cca; multi-writer; word spotting;
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICCV.2013.130