DocumentCode :
2489769
Title :
Indexing and retrieving cursive documents without recognition
Author :
Clavelli, Antonio ; Cordella, Luigi P. ; De Stefano, Claudio ; Marcelli, Angelo
Author_Institution :
DIIIE, Univ. of Salerno, Fisciano, Italy
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
A large amount of handwritten documents exist in image form, as scanned documents. The supporting electronic media allows for better preservation, but to access their content they must be processed by some kind of recognition technologies that convert the image to searchable text. In case of cursively written documents, even the best available technology introduces recognition errors that may drive down the performance of a document retrieval system. We propose a recognition-free approach which embodies two main components: a shape matching algorithm, working on the ink, and a string matching algorithm working on the ink interpretation of a reference set. Experiments on a data set of 16,500 cursive words produced by hundreds of writers show promising results and suggest that the proposed method can be a viable tool to build inexpensive retrieval system for cursive documents.
Keywords :
document image processing; image matching; image retrieval; cursive document indexing; cursive document retrieval; electronic media; handwritten documents; recognition-free approach; scanned documents; shape matching algorithm; string matching algorithm; Character recognition; Costs; Error correction; Humans; Image converters; Image recognition; Image retrieval; Indexing; Information retrieval; Ink;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761833
Filename :
4761833
Link To Document :
بازگشت