DocumentCode
243580
Title
A Semi-Supervised Self-Adaptive Classifier over Opinionated Streams
Author
Zimmerann, Max ; Ntoutsi, Eirini ; Spiliopoulou, Myra
Author_Institution
Fac. of Comput. Sci., Otto-von-Guericke-Univ. of Magdeburg, Magdeburg, Germany
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
425
Lastpage
432
Abstract
We investigate the problem of polarity learning over a stream of opinionated documents. We deal with two challenges. First, if the opinions are not labeled, then we cannot assume that a human expert will be regularly and frequently available to assess the sentiment of arriving documents for learning and model adaption. Further, the vocabulary of the stream, and thus the feature space used for learning, changesover time: people use an abundancy of words, and sometime seven invent new ones to express their feelings. We propose a semi-supervised opinion stream classification algorithm that uses only an initial training set of labeled documents for polarity learning and gradually adapts to changes in the vocabulary. In particular, our algorithm S*3 Learner starts with the vocabulary of opinionated words that are in the documents of the initial training set, and then expands it with new words, as soon as there is enough evidence for estimating their polarity. We study the performance of S*3 Learneron opinionated streams under the natural order of document arrival and under a modified ordering that allows us to simulate vocabulary evolution.
Keywords
classification; document handling; expert systems; learning (artificial intelligence); vocabulary; S*3 learner; arriving document; document arrival; feature space; human expert; labeled document; learning adaptation; model adaption; opinionated document; opinionated stream; opinionated word; polarity learning; semisupervised opinion stream classification algorithm; semisupervised self-adaptive classifier; vocabulary evolution; Entropy; Estimation; Integrated circuits; Sentiment analysis; Training; Twitter; Vocabulary; Big Data Streams; Semi-Supervised Stream Learning; Stream Classification; Twitter Sentiment Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.106
Filename
7022627
Link To Document