• DocumentCode
    243540
  • Title

    Supervised Adaptive-Transfer PLSA for Cross-Domain Text Classification

  • Author

    Rui Zhao ; Kezhi Mao

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    259
  • Lastpage
    266
  • Abstract
    Cross-domain learning is a very promising technique to improve classification in the target (testing) domain whose data distributions are very different from the source (training) domain. Many cross-domain text classification methods are built on topic modeling approaches. However, topic model methods are unsupervised in nature without fully utilizing the label information of the source domain. In addition, almost all cross-domain learning approaches utilize the knowledge of source domain in the later stage of the training process, and this limits the knowledge transfer. In this paper, we propose a model named Supervised Adaptive transfer Probabilistic Latent Semantic Analysis (SAtPLSA) for cross-domain text classification aiming to deal with the above two issues. The proposed model extends the original PLSA to a supervised learning paradigm. By defining the common labeled information from each term across domains, we transfer knowledge in source domain to assist classifying text in target domain. In addition, we adaptively modify the weight value controlling the proportion of the usage of knowledge from source domain in the model learning process. At last, we conducted experiments on nine benchmark datasets in cross domain text classification to compare the performance of our proposed algorithm with two classical supervised learning methods and five state-of-art transfer learning approaches. The experimental results have shown the effectiveness and efficiency of our proposed SAtPLSA algorithm.
  • Keywords
    learning (artificial intelligence); pattern classification; probability; text analysis; SAtPLSA; cross-domain learning; cross-domain text classification; data distributions; knowledge transfer; source domain; supervised adaptive transfer probabilistic latent semantic analysis; supervised adaptive-transfer PLSA; supervised learning; target domain; training process; Accuracy; Adaptation models; Analytical models; Knowledge transfer; Semantics; Testing; Training; Cross-Domain Learning; PLSA; Text Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.163
  • Filename
    7022606