Title :
A Semi-Supervised Self-Adaptive Classifier over Opinionated Streams
Author :
Zimmerann, Max ; Ntoutsi, Eirini ; Spiliopoulou, Myra
Author_Institution :
Fac. of Comput. Sci., Otto-von-Guericke-Univ. of Magdeburg, Magdeburg, Germany
Abstract :
We investigate the problem of polarity learning over a stream of opinionated documents. We deal with two challenges. First, if the opinions are not labeled, then we cannot assume that a human expert will be regularly and frequently available to assess the sentiment of arriving documents for learning and model adaption. Further, the vocabulary of the stream, and thus the feature space used for learning, changesover time: people use an abundancy of words, and sometime seven invent new ones to express their feelings. We propose a semi-supervised opinion stream classification algorithm that uses only an initial training set of labeled documents for polarity learning and gradually adapts to changes in the vocabulary. In particular, our algorithm S*3 Learner starts with the vocabulary of opinionated words that are in the documents of the initial training set, and then expands it with new words, as soon as there is enough evidence for estimating their polarity. We study the performance of S*3 Learneron opinionated streams under the natural order of document arrival and under a modified ordering that allows us to simulate vocabulary evolution.
Keywords :
classification; document handling; expert systems; learning (artificial intelligence); vocabulary; S*3 learner; arriving document; document arrival; feature space; human expert; labeled document; learning adaptation; model adaption; opinionated document; opinionated stream; opinionated word; polarity learning; semisupervised opinion stream classification algorithm; semisupervised self-adaptive classifier; vocabulary evolution; Entropy; Estimation; Integrated circuits; Sentiment analysis; Training; Twitter; Vocabulary; Big Data Streams; Semi-Supervised Stream Learning; Stream Classification; Twitter Sentiment Analysis;
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
DOI :
10.1109/ICDMW.2014.106