• DocumentCode
    3604358
  • Title

    Cross-Modal Subspace Learning via Pairwise Constraints

  • Author

    Ran He ; Man Zhang ; Liang Wang ; Ye Ji ; Qiyue Yin

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • Volume
    24
  • Issue
    12
  • fYear
    2015
  • Firstpage
    5543
  • Lastpage
    5556
  • Abstract
    In multimedia applications, the text and image components in a web document form a pairwise constraint that potentially indicates the same semantic concept. This paper studies cross-modal learning via the pairwise constraint and aims to find the common structure hidden in different modalities. We first propose a compound regularization framework to address the pairwise constraint, which can be used as a general platform for developing cross-modal algorithms. For unsupervised learning, we propose a multi-modal subspace clustering method to learn a common structure for different modalities. For supervised learning, to reduce the semantic gap and the outliers in pairwise constraints, we propose a cross-modal matching method based on compound ℓ21 regularization. Extensive experiments demonstrate the benefits of joint text and image modeling with semantically induced pairwise constraints, and they show that the proposed cross-modal methods can further reduce the semantic gap between different modalities and improve the clustering/matching accuracy.
  • Keywords
    Internet; image matching; multimedia computing; pattern clustering; unsupervised learning; Web document; compound ℓ21 regularization; compound regularization framework; cross-modal algorithms; cross-modal matching method; cross-modal subspace learning; image components; image modeling; multimedia applications; multimodal subspace clustering method; semantically induced pairwise constraints; supervised learning; text components; text modeling; unsupervised learning; Clustering algorithms; Clustering methods; Dictionaries; Multimedia communication; Semantics; Supervised learning; Multi modal; multi modal; pairwise constraint; subspace clustering;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2466106
  • Filename
    7182358