• DocumentCode
    3255223
  • Title

    Deep Transfer Learning via Restricted Boltzmann Machine for Document Classification

  • Author

    Zhang, Jian

  • Author_Institution
    CS Dept., Louisiana State Univ., Baton Rouge, LA, USA
  • Volume
    1
  • fYear
    2011
  • fDate
    18-21 Dec. 2011
  • Firstpage
    323
  • Lastpage
    326
  • Abstract
    Transfer learning aims to improve a targeted learning task using other related auxiliary learning tasks and data. Most current transfer-learning methods focus on scenarios where the auxiliary and the target learning tasks are very similar: either (some of) the auxiliary data can be directly used as training examples for the target task or the auxiliary and the target data share the same representation. However, in many cases the connection between the auxiliary and the target tasks can be remote. Only a few features derived from the auxiliary data may be helpful for the target learning. We call such scenario the deep transfer-learning scenario and we introduce a novel transfer-learning method for deep transfer. Our method uses restricted Boltzmann machine to discover a set of hierarchical features from the auxiliary data. We then select from these features a subset that are helpful for the target learning, using a selection criterion based on the concept of kernel-target alignment. Finally, the target data are augmented with the selected features before training. Our experiment results show that this transfer method is effective. It can improve classification accuracy by up to more than 10%, even when the connection between the auxiliary and the target tasks is not apparent.
  • Keywords
    Boltzmann machines; document handling; learning (artificial intelligence); Boltzmann machine; document classification; hierarchical feature; kernel-target alignment; transfer learning; Accuracy; Data models; Kernel; Machine learning; Random variables; Training; Vectors; document classification; restricted Boltzmann machine; transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    978-1-4577-2134-2
  • Type

    conf

  • DOI
    10.1109/ICMLA.2011.51
  • Filename
    6146992