• DocumentCode
    2011828
  • Title

    Word Image Retrieval Using Bag of Visual Words

  • Author

    Shekhar, Ravi ; Jawahar, C.V.

  • Author_Institution
    Center for Visual Inf. Technol., IIIT Hyderabad, Hyderabad, India
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    297
  • Lastpage
    301
  • Abstract
    This paper presents a Bag of Visual Words (BoVW) based approach to retrieve similar word images from a large database, efficiently and accurately. We show that a text retrieval system can be adapted to build a word image retrieval solution. This helps in achieving scalability. We demonstrate the method on more than 1 Million word images with a sub-second retrieval time. We validate the method on four Indian languages, and report a mean average precision of more than 0.75. We represent the word images as histogram of visual words present in the image. Visual words are quantized representation of local regions, and for this work, SIFT descriptors at interest points are used as feature vectors. To address the lack of spatial structure in the BoVW representation, we re-rank the retrieved list. This significantly improves the performance.
  • Keywords
    document image processing; feature extraction; image retrieval; linguistics; natural languages; word processing; Indian languages; SIFT descriptors; bag of visual words; feature vectors; local region quantized representation; mean average precision; text retrieval system; visual word histogram; word image retrieval; Feature extraction; Histograms; Indexing; Vectors; Visualization; Vocabulary; Bag of Visual Words; Scalability; Word Image Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.96
  • Filename
    6195382