Title of article :
Towards a more discriminative and semantic visual vocabulary
Author/Authors :
Lَpez-Sastre، نويسنده , , R.J. and Tuytelaars، نويسنده , , T. and Acevedo-Rodrيguez، نويسنده , , F.J. and Maldonado-Bascَn، نويسنده , , S.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
11
From page :
415
To page :
425
Abstract :
We present a novel method for constructing a visual vocabulary that takes into account the class labels of images, thus resulting in better recognition performance and more efficient learning. Our method consists of two stages: Cluster Precision Maximisation (CPM) and Adaptive Refinement. In the first stage, a Reciprocal Nearest Neighbours (RNN) clustering algorithm is guided towards class representative visual words by maximising a new cluster precision criterion. As we are able to optimise the vocabulary without the need for expensive cross-validation, the overall training time is significantly reduced without a negative impact on the results. Next, an adaptive threshold refinement scheme is proposed with the aim of increasing vocabulary compactness while at the same time improving the recognition rate and further increasing the representativeness of the visual words for category-level object recognition. This is a correlation clustering based approach, which works as a meta-clustering and optimises the cut-off threshold for each cluster separately. In the experiments we analyse the recognition rate of different vocabularies for a subset of the Caltech 101 dataset, showing how RNN in combination with CPM selects the optimal codebooks, and how the clustering refinement step succeeds in further increasing the recognition rate.
Keywords :
Category-level object recognition , Correlation clustering , Visual vocabulary , Reciprocal Nearest Neighbours , Bag-of-Words , Cluster precision
Journal title :
Computer Vision and Image Understanding
Serial Year :
2011
Journal title :
Computer Vision and Image Understanding
Record number :
1696192
Link To Document :
بازگشت