Title :
Visual Words Assignment Via Information-Theoretic Manifold Embedding
Author :
Yue Deng ; Yipeng Li ; Yanjun Qian ; Xiangyang Ji ; Qionghai Dai
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
Abstract :
Codebook-based learning provides a flexible way to extract the contents of an image in a data-driven manner for visual recognition. One central task in such frameworks is codeword assignment, which allocates local image descriptors to the most similar codewords in the dictionary to generate histogram for categorization. Nevertheless, existing assignment approaches, e.g., nearest neighbors strategy (hard assignment) and Gaussian similarity (soft assignment), suffer from two problems: 1) too strong Euclidean assumption and 2) neglecting the label information of the local descriptors. To address the aforementioned two challenges, we propose a graph assignment method with maximal mutual information (GAMI) regularization. GAMI takes the power of manifold structure to better reveal the relationship of massive number of local features by nonlinear graph metric. Meanwhile, the mutual information of descriptor-label pairs is ultimately optimized in the embedding space for the sake of enhancing the discriminant property of the selected codewords. According to such objective, two optimization models, i.e., inexact-GAMI and exact-GAMI, are respectively proposed in this paper. The inexact model can be efficiently solved with a closed-from solution. The stricter exact-GAMI nonparametrically estimates the entropy of descriptor-label pairs in the embedding space and thus leads to a relatively complicated but still trackable optimization. The effectiveness of GAMI models are verified on both the public and our own datasets.
Keywords :
computer vision; feature extraction; graph theory; image recognition; learning (artificial intelligence); Euclidean assumption; GAMI regularization; Gaussian similarity; closed-from solution; codebook-based learning; codeword assignment; descriptor-label pairs; embedding space; exact-GAMI model; graph assignment method with maximal mutual information; image categorization; image contents extraction; image descriptors; inexact-GAMI model; information-theoretic manifold embedding; nearest neighbors strategy; nonlinear graph metric; visual recognition; visual words assignment; Entropy; Histograms; Manifolds; Measurement; Mutual information; Optimization; Visualization; Manifold embedding; mutual information; scene categorization; visual words assignment;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TCYB.2014.2300192