DocumentCode :
3603258
Title :
Multi-View Concept Learning for Data Representation
Author :
Ziyu Guan ; Lijun Zhang ; Jinye Peng ; Jianping Fan
Author_Institution :
Coll. of Inf. & Technol., Northwest Univ. of China, Xian, China
Volume :
27
Issue :
11
fYear :
2015
Firstpage :
3016
Lastpage :
3028
Abstract :
Real-world datasets often involve multiple views of data items, e.g., a Web page can be described by both its content and anchor texts of hyperlinks leading to it; photos in Flickr could be characterized by visual features, as well as user contributed tags. Different views provide information complementary to each other. Synthesizing multi-view features can lead to a comprehensive description of the data items, which could benefit many data analytic applications. Unfortunately, the simple idea of concatenating different feature vectors ignores statistical properties of each view and usually incurs the “curse of dimensionality” problem. We propose Multi-view Concept Learning (MCL), a novel nonnegative latent representation learning algorithm for capturing conceptual factors from multi-view data. MCL exploits both multi-view information and label information. The key idea is to learn a common latent space across different views which (1) captures the semantic relationships between data items through graph embedding regularization on labeled items, and (2) allows each latent factor to be associated with a subset of views via sparseness constraints. In this way, MCL could capture flexible conceptual patterns hidden in multi-view features. Experiments on a toy problem and three real-world datasets show that MCL performs well and outperforms baseline methods.
Keywords :
data analysis; data structures; graph theory; vectors; Flickr; MCL; conceptual factors; curse of dimensionality problem; data analytic applications; data representation; feature vectors; graph embedding regularization; mult-view concept learning; multiview concept learning; nonnegative latent representation learning algorithm; semantic relationships; statistical properties; user contributed tags; Electronic mail; Encoding; Linear programming; Optimization methods; Semantics; Visualization; Graph Embedding; Multi-view Learning; Multi-view learning; Nonnegative Matrix Factorization; Structured Sparsity; graph embedding; nonnegative matrix factorization; structured sparsity;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2015.2448542
Filename :
7130644
Link To Document :
بازگشت