• DocumentCode
    243580
  • Title

    A Semi-Supervised Self-Adaptive Classifier over Opinionated Streams

  • Author

    Zimmerann, Max ; Ntoutsi, Eirini ; Spiliopoulou, Myra

  • Author_Institution
    Fac. of Comput. Sci., Otto-von-Guericke-Univ. of Magdeburg, Magdeburg, Germany
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    425
  • Lastpage
    432
  • Abstract
    We investigate the problem of polarity learning over a stream of opinionated documents. We deal with two challenges. First, if the opinions are not labeled, then we cannot assume that a human expert will be regularly and frequently available to assess the sentiment of arriving documents for learning and model adaption. Further, the vocabulary of the stream, and thus the feature space used for learning, changesover time: people use an abundancy of words, and sometime seven invent new ones to express their feelings. We propose a semi-supervised opinion stream classification algorithm that uses only an initial training set of labeled documents for polarity learning and gradually adapts to changes in the vocabulary. In particular, our algorithm S*3 Learner starts with the vocabulary of opinionated words that are in the documents of the initial training set, and then expands it with new words, as soon as there is enough evidence for estimating their polarity. We study the performance of S*3 Learneron opinionated streams under the natural order of document arrival and under a modified ordering that allows us to simulate vocabulary evolution.
  • Keywords
    classification; document handling; expert systems; learning (artificial intelligence); vocabulary; S*3 learner; arriving document; document arrival; feature space; human expert; labeled document; learning adaptation; model adaption; opinionated document; opinionated stream; opinionated word; polarity learning; semisupervised opinion stream classification algorithm; semisupervised self-adaptive classifier; vocabulary evolution; Entropy; Estimation; Integrated circuits; Sentiment analysis; Training; Twitter; Vocabulary; Big Data Streams; Semi-Supervised Stream Learning; Stream Classification; Twitter Sentiment Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.106
  • Filename
    7022627