DocumentCode :
3487885
Title :
Bringing Semantics in Word Image Retrieval
Author :
Krishnan, Prasad ; Jawahar, C.V.
Author_Institution :
Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad, India
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
733
Lastpage :
737
Abstract :
Performance of the recognition free approaches for document retrieval, heavily depends on the exact or approximate matching of images (in some feature space) to retrieve documents containing the same word. However, the harder problem in information retrieval is to effectively bring semantics into the retrieval pipeline. This is further challenging when the matching is based on visual features. In this work, we investigate this problem, and suggest a solution by directly transferring the semantics from the textual domain. Our retrieval framework uses (i) the language resources like Word Net and (ii) an annotated corpus of document images, to retrieve semantically relevant words from a large word image database. We demonstrate the method on two languages - English and Hindi, and quantitatively evaluate the performance on annotated word image databases of more than a Million images.
Keywords :
document image processing; image matching; image retrieval; natural language processing; English language; Hindi language; Word Net; annotated corpus; approximate image matching; document images; document retrieval; exact image matching; information retrieval; language resources; large word image database; recognition free approaches; word image retrieval; Equations; Indexing; Mathematical model; Semantics; Visualization; Vocabulary; Bag of Words; Semantic Indexing; Word Image Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.150
Filename :
6628715
Link To Document :
بازگشت