Title :
Discovering Image Semantics in Codebook Derivative Space
Author :
Wang, Jinjun ; Gong, Yihong
Author_Institution :
Epson R&D, Inc., San Jose, CA, USA
Abstract :
The sparse coding based approaches for image recognition have recently shown improved performance than traditional bag-of-features technique. Due to high dimensionality of the image descriptor space, existing systems usually require very large codebook size to minimize coding error in order to get satisfactory accuracy. While most research efforts try to address the problem by constructing a relatively smaller codebook with stronger discriminative power, in this paper, we introduce an alternative solution by enhancing the quality of coding. Particularly, we apply the idea similar to Fisher kernel to the coding framework, where we use the image-dependent codebook derivative to represent the image. The proposed idea is generic across multiple coding criteria, and in this paper, it is applied to enhance the locality-constraint linear coding (LLC). Experiments show that, the extracted new feature, called “LLC+,” achieved significantly improved accuracy on several challenging datasets even with a small codebook of 1/20 the reported size used by LLC. This obviously adds to LLC+ the modeling accuracy, processing speed and codebook training advantages.
Keywords :
feature extraction; image coding; image recognition; image representation; linear codes; Fisher kernel; LLC+; codebook derivative space; codebook training advantages; coding error minimization; coding quality enhancement; feature extraction; high image descriptor space dimensionality; image recognition; image representation; image semantics discovery; image-dependent codebook derivative; locality-constraint linear coding; modeling accuracy; multiple coding criteria; performance improvement; processing speed; sparse coding based approach; very large codebook size; Accuracy; Dictionaries; Encoding; Image coding; Image reconstruction; Image representation; Training; Codebook derivative; multiclass image classification; sparse-coding;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2012.2186120