• DocumentCode
    2634990
  • Title

    Building a General Purpose Cross-Domain Sentiment Mining Model

  • Author

    Whitehead, Matthew ; Yaeger, Larry

  • Author_Institution
    Sch. of Inf., Indiana Univ., Bloomington, IN, USA
  • Volume
    4
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    472
  • Lastpage
    476
  • Abstract
    Building a model using machine learning that can classify the sentiment of natural language text often requires an extensive set of labeled training data from the same domain as the target text. Gathering and labeling new datasets whenever a model is needed for a new domain is time-consuming and difficult, especially if a dataset with numeric ratings is not available. In this paper we consider the problem of building models that have a high sentiment classification accuracy without the aid of a labeled dataset from the target domain. We show that ensembles of existing domain models can be used to achieve a classification accuracy that approaches that of models trained on data from the target domain.
  • Keywords
    data mining; learning (artificial intelligence); natural language processing; pattern classification; dataset labeling; general purpose cross-domain sentiment mining model; high sentiment classification accuracy; machine learning; natural language text; Digital cameras; Drugs; Informatics; Machine learning; Motion pictures; Natural languages; Portable computers; Support vector machines; Testing; Training data; Data Mining; Machine Learning; Opinion Mining; Sentiment Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.754
  • Filename
    5171041