Title :
Indexing and retrieving cursive documents without recognition
Author :
Clavelli, Antonio ; Cordella, Luigi P. ; De Stefano, Claudio ; Marcelli, Angelo
Author_Institution :
DIIIE, Univ. of Salerno, Fisciano, Italy
Abstract :
A large amount of handwritten documents exist in image form, as scanned documents. The supporting electronic media allows for better preservation, but to access their content they must be processed by some kind of recognition technologies that convert the image to searchable text. In case of cursively written documents, even the best available technology introduces recognition errors that may drive down the performance of a document retrieval system. We propose a recognition-free approach which embodies two main components: a shape matching algorithm, working on the ink, and a string matching algorithm working on the ink interpretation of a reference set. Experiments on a data set of 16,500 cursive words produced by hundreds of writers show promising results and suggest that the proposed method can be a viable tool to build inexpensive retrieval system for cursive documents.
Keywords :
document image processing; image matching; image retrieval; cursive document indexing; cursive document retrieval; electronic media; handwritten documents; recognition-free approach; scanned documents; shape matching algorithm; string matching algorithm; Character recognition; Costs; Error correction; Humans; Image converters; Image recognition; Image retrieval; Indexing; Information retrieval; Ink;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761833