DocumentCode
2634990
Title
Building a General Purpose Cross-Domain Sentiment Mining Model
Author
Whitehead, Matthew ; Yaeger, Larry
Author_Institution
Sch. of Inf., Indiana Univ., Bloomington, IN, USA
Volume
4
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
472
Lastpage
476
Abstract
Building a model using machine learning that can classify the sentiment of natural language text often requires an extensive set of labeled training data from the same domain as the target text. Gathering and labeling new datasets whenever a model is needed for a new domain is time-consuming and difficult, especially if a dataset with numeric ratings is not available. In this paper we consider the problem of building models that have a high sentiment classification accuracy without the aid of a labeled dataset from the target domain. We show that ensembles of existing domain models can be used to achieve a classification accuracy that approaches that of models trained on data from the target domain.
Keywords
data mining; learning (artificial intelligence); natural language processing; pattern classification; dataset labeling; general purpose cross-domain sentiment mining model; high sentiment classification accuracy; machine learning; natural language text; Digital cameras; Drugs; Informatics; Machine learning; Motion pictures; Natural languages; Portable computers; Support vector machines; Testing; Training data; Data Mining; Machine Learning; Opinion Mining; Sentiment Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.754
Filename
5171041
Link To Document