DocumentCode :
1588904
Title :
On automatic similarity linking in digital libraries
Author :
Myka, A. ; Güntzer, U.
Author_Institution :
Wilhelm-Schickard-Inst., Tubingen Univ., Germany
fYear :
1997
Firstpage :
278
Lastpage :
283
Abstract :
Hypertext links are a powerful extension of standard information retrieval techniques based on query languages. However the generation of links is often impractical due to large manual and/or computational effort. We analyze the effects of two main approaches that aim at a restriction of the necessary efforts: the direct use of OCR-processed documents instead of manually post-processed, i.e. corrected documents; and the use of shorter excerpts of documents instead of complete documents. For our tests, similarity links were computed based on the vector-space model; the links that are generated based on unmodified OCR documents and excerpts of documents are then compared to those links that are generated based on complete documents without OCR errors
Keywords :
document image processing; full-text databases; hypermedia; information retrieval; library automation; optical character recognition; query languages; OCR errors; OCR processed documents; automatic similarity linking; complete documents; computational effort; digital libraries; hypertext links; information retrieval techniques; query languages; unmodified OCR documents; vector-space model; Database languages; Information retrieval; Joining processes; Navigation; Optical character recognition software; Software libraries; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 1997. Proceedings., Eighth International Workshop on
Conference_Location :
Toulouse
Print_ISBN :
0-8186-8147-0
Type :
conf
DOI :
10.1109/DEXA.1997.617294
Filename :
617294
Link To Document :
بازگشت