DocumentCode :
2011828
Title :
Word Image Retrieval Using Bag of Visual Words
Author :
Shekhar, Ravi ; Jawahar, C.V.
Author_Institution :
Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad, India
fYear :
2012
fDate :
27-29 March 2012
Firstpage :
297
Lastpage :
301
Abstract :
This paper presents a Bag of Visual Words (BoVW) based approach to retrieve similar word images from a large database, efficiently and accurately. We show that a text retrieval system can be adapted to build a word image retrieval solution. This helps in achieving scalability. We demonstrate the method on more than 1 Million word images with a sub-second retrieval time. We validate the method on four Indian languages, and report a mean average precision of more than 0.75. We represent the word images as histogram of visual words present in the image. Visual words are quantized representation of local regions, and for this work, SIFT descriptors at interest points are used as feature vectors. To address the lack of spatial structure in the BoVW representation, we re-rank the retrieved list. This significantly improves the performance.
Keywords :
document image processing; feature extraction; image retrieval; linguistics; natural languages; word processing; Indian languages; SIFT descriptors; bag of visual words; feature vectors; local region quantized representation; mean average precision; text retrieval system; visual word histogram; word image retrieval; Feature extraction; Histograms; Indexing; Vectors; Visualization; Vocabulary; Bag of Visual Words; Scalability; Word Image Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
Type :
conf
DOI :
10.1109/DAS.2012.96
Filename :
6195382
Link To Document :
بازگشت