Title :
A Method Based on Generation Models for Analyzing Sentiment-Topic in Texts
Author :
Fan Na ; Cai Wan-dong ; Zhao Yu
Author_Institution :
Coll. of Comput. Sci., Northwestern Polytech. Univ., Xi´an, China
Abstract :
This paper proposes a method based on generation model for sentiment analysis and topic identification in texts. Firstly sentiment and topic of training texts are labeled by hand and sentiment models and topic models are established. Secondly compute the Kullback-Leibler divergence between a testing text and sentiment models in order to determine sentiment of the text. Similarly, calculate the Kullback-Leibler divergence between the testing text and topic model, so the topic of text can be identified. The unigram and bigram of words are employed as the model parameters, and correspondingly maximum likelihood estimation and some smoothing techniques are used to estimate these parameters. Empirical experiments on product reviews corpus show that this language modeling approach performs better than SVM and obtains improvement on precision. Moreover this method is better than SVM in robustness.
Keywords :
data mining; maximum likelihood estimation; text analysis; Kullback-Leibler divergence; language modeling; maximum likelihood estimation; sentiment-topic analysis; smoothing technique; topic identification; word bigram; word unigram; Computer aided instruction; Computer science; Data mining; Educational institutions; Information analysis; Machine learning algorithms; Maximum likelihood estimation; Support vector machine classification; Support vector machines; Testing;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5362943