Title :
Feature Distributions in Domain Adaptation
Author :
Uribe, D. ; Cuan, E.
Author_Institution :
Inst. Tecnol. de la Laguna, Torreon, Mexico
Abstract :
The adaptation problem in sentiment classification is approached in this paper. Since the availability of labeled data required by sentiment classifiers is not always possible, given a set of labeled data from different domains and a small amount of labeled data of the target domain, it would be interesting to determine which subset of those domains has a feature distribution most similar to the target domain. In this paper, we propose a meaningful examination of the overlap between two feature sets in order to obtain the most similar distribution to the target domain. The results of the experimentation show how our method is oriented to select features that have a good correlation with the target domain.
Keywords :
natural language processing; pattern classification; text analysis; feature distributions; feature selection; sentiment classification; feature distribution; labeled data; sentiment classification;
Conference_Titel :
Electronics, Robotics and Automotive Mechanics Conference (CERMA), 2012 IEEE Ninth
Conference_Location :
Cuernavaca
Print_ISBN :
978-1-4673-5096-9
DOI :
10.1109/CERMA.2012.33