DocumentCode
3487594
Title
Document Specific Sparse Coding for Word Retrieval
Author
Shekhar, Ravi ; Jawahar, C.V.
Author_Institution
Centre for Visual Inf. Technol., Int. Inst. of Inf. Technol., Hyderabad, India
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
643
Lastpage
647
Abstract
Bag of words (BoW) based retrieval is an efficient method to compare the visual similarity between two images. Recognition free methods based on BoW have shown to outperform OCR based methods. We further improve the performance by defining a document specific sparse coding scheme for representing visual words (interest points) in document images. Our method is motivated by the successful use of sparsity in signal representation by exploiting the neighbourhood properties. In addition to providing insights into the design of the coding scheme, we also verify the method on two data sets and compare with the recent methods. We have also developed text query based search solution, and we report performance comparable to image based search.
Keywords
document image processing; image coding; image representation; information retrieval; BoW based retrieval; bag of words based retrieval; document images; document specific sparse coding; recognition free methods; signal representation; text query based search solution; visual similarity; Encoding; Feature extraction; Image coding; Quantization (signal); Vectors; Visualization; Vocabulary; Bag of Words; Document Image Retrieval; Sparse Coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.132
Filename
6628697
Link To Document