Title :
m-SNE: Multiview Stochastic Neighbor Embedding
Author :
Xie, Bo ; Mu, Yang ; Tao, Dacheng ; Huang, Kaiqi
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
Dimension reduction has been widely used in real-world applications such as image retrieval and document classification. In many scenarios, different features (or multiview data) can be obtained, and how to duly utilize them is a challenge. It is not appropriate for the conventional concatenating strategy to arrange features of different views into a long vector. That is because each view has its specific statistical property and physical interpretation. Even worse, the performance of the concatenating strategy will deteriorate if some views are corrupted by noise. In this paper, we propose a multiview stochastic neighbor embedding (m-SNE) that systematically integrates heterogeneous features into a unified representation for subsequent processing based on a probabilistic framework. Compared with conventional strategies, our approach can automatically learn a combination coefficient for each view adapted to its contribution to the data embedding. This combination coefficient plays an important role in utilizing the complementary information in multiview data. Also, our algorithm for learning the combination coefficient converges at a rate of O(1/k2), which is the optimal rate for smooth problems. Experiments on synthetic and real data sets suggest the effectiveness and robustness of m-SNE for data visualization, image retrieval, object categorization, and scene recognition.
Keywords :
computational complexity; data visualisation; document image processing; image retrieval; object recognition; probability; stochastic processes; combination coefficient; complementary information; concatenating strategy; data embedding; data visualization; dimension reduction; document classification; heterogeneous feature; image retrieval; m-SNE; multiview data; multiview stochastic neighbor embedding; object categorization; optimal rate; probabilistic framework; real data set; real world application; scene recognition; statistical property; synthetic data set; Algorithm design and analysis; Image color analysis; Image retrieval; Noise; Optimization; Probabilistic logic; Probability distribution; Dimension reduction; image retrieval; multiview learning; stochastic neighbor embedding;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2011.2106208