Title :
Part-Structured Inkball Models for One-Shot Handwritten Word Spotting
Author :
Howe, Nicholas R.
Author_Institution :
Smith Coll., Northampton, MA, USA
Abstract :
Many document collections of historical interest are handwritten and lack transcripts. Scholars need tools for high-quality information retrieval in such environments, preferably without the burden of extensive system training. This paper presents a novel approach to word spotting designed for manuscripts or degraded print that requires minimal initial training. It can infer a generative word appearance model from a single instance, and then use the model to retrieve similar words from arbitrary documents. An approximation to the retrieval statistic runs efficiently on graphics processing hardware. Tested on two standard data sets, the method compares favorably with prior results.
Keywords :
document image processing; graphics processing units; handwritten character recognition; history; image retrieval; learning (artificial intelligence); word processing; arbitrary documents; degraded print; graphic processing hardware; high-quality information retrieval statistic; historic document collection; initial training; manuscripts; one-shot handwritten word spotting; part-structured inkball models; standard data sets; Computational modeling; Deformable models; Handwriting recognition; Ink; Skeleton; Training; Transforms; handwriting recognition; inkball models; one-shot learning; part-structured models; word spotting;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.121