• DocumentCode
    3603258
  • Title

    Multi-View Concept Learning for Data Representation

  • Author

    Ziyu Guan ; Lijun Zhang ; Jinye Peng ; Jianping Fan

  • Author_Institution
    Coll. of Inf. & Technol., Northwest Univ. of China, Xian, China
  • Volume
    27
  • Issue
    11
  • fYear
    2015
  • Firstpage
    3016
  • Lastpage
    3028
  • Abstract
    Real-world datasets often involve multiple views of data items, e.g., a Web page can be described by both its content and anchor texts of hyperlinks leading to it; photos in Flickr could be characterized by visual features, as well as user contributed tags. Different views provide information complementary to each other. Synthesizing multi-view features can lead to a comprehensive description of the data items, which could benefit many data analytic applications. Unfortunately, the simple idea of concatenating different feature vectors ignores statistical properties of each view and usually incurs the “curse of dimensionality” problem. We propose Multi-view Concept Learning (MCL), a novel nonnegative latent representation learning algorithm for capturing conceptual factors from multi-view data. MCL exploits both multi-view information and label information. The key idea is to learn a common latent space across different views which (1) captures the semantic relationships between data items through graph embedding regularization on labeled items, and (2) allows each latent factor to be associated with a subset of views via sparseness constraints. In this way, MCL could capture flexible conceptual patterns hidden in multi-view features. Experiments on a toy problem and three real-world datasets show that MCL performs well and outperforms baseline methods.
  • Keywords
    data analysis; data structures; graph theory; vectors; Flickr; MCL; conceptual factors; curse of dimensionality problem; data analytic applications; data representation; feature vectors; graph embedding regularization; mult-view concept learning; multiview concept learning; nonnegative latent representation learning algorithm; semantic relationships; statistical properties; user contributed tags; Electronic mail; Encoding; Linear programming; Optimization methods; Semantics; Visualization; Graph Embedding; Multi-view Learning; Multi-view learning; Nonnegative Matrix Factorization; Structured Sparsity; graph embedding; nonnegative matrix factorization; structured sparsity;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2448542
  • Filename
    7130644