DocumentCode
2011828
Title
Word Image Retrieval Using Bag of Visual Words
Author
Shekhar, Ravi ; Jawahar, C.V.
Author_Institution
Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad, India
fYear
2012
fDate
27-29 March 2012
Firstpage
297
Lastpage
301
Abstract
This paper presents a Bag of Visual Words (BoVW) based approach to retrieve similar word images from a large database, efficiently and accurately. We show that a text retrieval system can be adapted to build a word image retrieval solution. This helps in achieving scalability. We demonstrate the method on more than 1 Million word images with a sub-second retrieval time. We validate the method on four Indian languages, and report a mean average precision of more than 0.75. We represent the word images as histogram of visual words present in the image. Visual words are quantized representation of local regions, and for this work, SIFT descriptors at interest points are used as feature vectors. To address the lack of spatial structure in the BoVW representation, we re-rank the retrieved list. This significantly improves the performance.
Keywords
document image processing; feature extraction; image retrieval; linguistics; natural languages; word processing; Indian languages; SIFT descriptors; bag of visual words; feature vectors; local region quantized representation; mean average precision; text retrieval system; visual word histogram; word image retrieval; Feature extraction; Histograms; Indexing; Vectors; Visualization; Vocabulary; Bag of Visual Words; Scalability; Word Image Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location
Gold Cost, QLD
Print_ISBN
978-1-4673-0868-7
Type
conf
DOI
10.1109/DAS.2012.96
Filename
6195382
Link To Document