DocumentCode
243540
Title
Supervised Adaptive-Transfer PLSA for Cross-Domain Text Classification
Author
Rui Zhao ; Kezhi Mao
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
259
Lastpage
266
Abstract
Cross-domain learning is a very promising technique to improve classification in the target (testing) domain whose data distributions are very different from the source (training) domain. Many cross-domain text classification methods are built on topic modeling approaches. However, topic model methods are unsupervised in nature without fully utilizing the label information of the source domain. In addition, almost all cross-domain learning approaches utilize the knowledge of source domain in the later stage of the training process, and this limits the knowledge transfer. In this paper, we propose a model named Supervised Adaptive transfer Probabilistic Latent Semantic Analysis (SAtPLSA) for cross-domain text classification aiming to deal with the above two issues. The proposed model extends the original PLSA to a supervised learning paradigm. By defining the common labeled information from each term across domains, we transfer knowledge in source domain to assist classifying text in target domain. In addition, we adaptively modify the weight value controlling the proportion of the usage of knowledge from source domain in the model learning process. At last, we conducted experiments on nine benchmark datasets in cross domain text classification to compare the performance of our proposed algorithm with two classical supervised learning methods and five state-of-art transfer learning approaches. The experimental results have shown the effectiveness and efficiency of our proposed SAtPLSA algorithm.
Keywords
learning (artificial intelligence); pattern classification; probability; text analysis; SAtPLSA; cross-domain learning; cross-domain text classification; data distributions; knowledge transfer; source domain; supervised adaptive transfer probabilistic latent semantic analysis; supervised adaptive-transfer PLSA; supervised learning; target domain; training process; Accuracy; Adaptation models; Analytical models; Knowledge transfer; Semantics; Testing; Training; Cross-Domain Learning; PLSA; Text Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.163
Filename
7022606
Link To Document