DocumentCode
3570654
Title
Spatial pyramid VLAD
Author
Renhao Zhou ; Qingsheng Yuan ; Xiaoguang Gu ; Dongming Zhang
Author_Institution
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
fYear
2014
Firstpage
342
Lastpage
345
Abstract
In recent years, VLAD has become a popular method which encoding powerful local descriptors to the compact representations. By using this approach, an image can be represented by just a few dozen bytes while preserving excellent retrieval results after the dimensionality reduction and compression. However, throwing away the spatial information is one of the biggest weaknesses of VLAD. This paper adopts the spatial pyramid pooling method to incorporate the spatial information into the VLAD vectors. Furthermore, a new normalization method is proposed to hold this advantage. By the proposed method, the performance of VLAD can be boosted through combining spatial information. The experimental results show that our approach outperforms VLAD in almost all configurations.
Keywords
image coding; image retrieval; dimensionality reduction; image compression; image retrieval; normalization method; spatial pyramid VLAD; vector of locally aggregated descriptors; Computer vision; Conferences; Pattern recognition; Principal component analysis; Vectors; Visualization; Vocabulary; Spatial Pyramid; VLAD; feature representation; image retrieval; normalization;
fLanguage
English
Publisher
ieee
Conference_Titel
Visual Communications and Image Processing Conference, 2014 IEEE
Type
conf
DOI
10.1109/VCIP.2014.7051576
Filename
7051576
Link To Document