Title :
Learning a compact latent representation of the Bag-of-Parts model
Author :
Xiaozhi Chen ; Huimin Ma
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
The Bag-of-Parts (BoP) model, which employs distinctive parts to represent images, has shown superior performance in vision recognition tasks. Our work is motivated by the need of reducing redundancy in tens of thousands parts. We propose a novel method to learn a compact latent representation from redundant part responses. We address this problem by employing spectral clustering and a multi-column coding scheme. The BoP model is viewed as a multi-scale convolutional model and additional sparse autoencoders are used to infer the latent patterns embedded in high-dimensional part-based representations. Spatial and semantic information is preserved by sparse learning on multiple spatial regions individually. Experiments demonstrate that the learnt representation achieves competitive performance with state-of-the-art methods on PASCAL VOC 2007 dataset.
Keywords :
image coding; image representation; learning (artificial intelligence); object recognition; pattern clustering; BoP; PASCAL VOC 2007 dataset; bag-of-parts model; compact latent representation learning; high-dimensional part-based representations; latent patterns; multicolumn coding scheme; multiscale convolutional model; redundant part responses; sparse autoencoders; spectral clustering; vision recognition tasks; Clustering algorithms; Detectors; Encoding; Redundancy; Support vector machines; Training; Visualization; BoP; mid-level representation; multi-column sparse autoencoders; spectral clustering;
Conference_Titel :
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location :
Paris
DOI :
10.1109/ICIP.2014.7026197