• DocumentCode
    3487594
  • Title

    Document Specific Sparse Coding for Word Retrieval

  • Author

    Shekhar, Ravi ; Jawahar, C.V.

  • Author_Institution
    Centre for Visual Inf. Technol., Int. Inst. of Inf. Technol., Hyderabad, India
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    643
  • Lastpage
    647
  • Abstract
    Bag of words (BoW) based retrieval is an efficient method to compare the visual similarity between two images. Recognition free methods based on BoW have shown to outperform OCR based methods. We further improve the performance by defining a document specific sparse coding scheme for representing visual words (interest points) in document images. Our method is motivated by the successful use of sparsity in signal representation by exploiting the neighbourhood properties. In addition to providing insights into the design of the coding scheme, we also verify the method on two data sets and compare with the recent methods. We have also developed text query based search solution, and we report performance comparable to image based search.
  • Keywords
    document image processing; image coding; image representation; information retrieval; BoW based retrieval; bag of words based retrieval; document images; document specific sparse coding; recognition free methods; signal representation; text query based search solution; visual similarity; Encoding; Feature extraction; Image coding; Quantization (signal); Vectors; Visualization; Vocabulary; Bag of Words; Document Image Retrieval; Sparse Coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.132
  • Filename
    6628697