• DocumentCode
    1256942
  • Title

    Multi-Label Transfer Learning With Sparse Representation

  • Author

    Han, Yahong ; Wu, Fei ; Zhuang, Yueting ; He, Xiaofei

  • Author_Institution
    Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
  • Volume
    20
  • Issue
    8
  • fYear
    2010
  • Firstpage
    1110
  • Lastpage
    1121
  • Abstract
    Due to the visually polysemous barrier, videos and images may be annotated by multiple tags. Discovering the correlations among different tags can significantly help predicting precise labels for videos and images. Many of recent studies toward multi-label learning construct a linear subspace embedding with encoded multi-label information, such that data points sharing many common labels tend to be close to each other in the embedded subspace. Motivated by the advances of compressive sensing research, a sparse representation that selects a compact subset to describe the input data can be more discriminative. In this paper, we propose a sparse multi-label learning method to circumvent the visually polysemous barrier of multiple tags. Our approach learns a multi-label encoded sparse linear embedding space from a related dataset, and maps the target data into the learned new representation space to achieve better annotation performance. Instead of using l1-norm penalty (lasso) to induce sparse representation, we propose to formulate the multi-label learning as a penalized least squares optimization problem with elastic-net penalty. By casting the video concept detection and image annotation tasks into a sparse multi-label transfer learning framework in this paper, ridge regression, lasso, elastic net, and the multi-label extended sparse discriminant analysis methods are, respectively, well explored and compared.
  • Keywords
    image retrieval; learning (artificial intelligence); optimisation; sparse matrices; video signal processing; image annotation tasks; least squares optimization problem; linear subspace; multilabel transfer learning; sparse representation; video concept detection; visually polysemous barrier; Image annotation; multi-label learning; sparse representation; transfer learning; video concept detection;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2010.2057015
  • Filename
    5523913