Title :
An unsupervised method to extract Chinese comment targets automatically
Author :
Xiangrong Quan ; Guoshi Wu ; Zilin He ; Lu Huang
Author_Institution :
Sch. of Software, Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
With the Internet developing, the quantity of comments accumulates rapidly, and sentiment analysis becomes a hot research task. A key preparation for sentiment analysis is to extract comment targets. In this paper an unsupervised method is proposed to judging the degree of their importance. We separate the method into two parts. One part uses explicit phrases´ cooccurrence, calculating conditional entropy. We take nouns and adjectives into consideration, and make full use of their relevance. The other part mines the relationship between clusters and terms based on LDA result, using a modified PageRank method. The two algorithms are combined to rescore these candidate targets. In the end, we use two ways to test our LPCE (LDA, PageRank, and Condition Entropy) model´s reliability, including word aspect and sentence aspect. Experiment results prove LPCE model is reasonable and effective.
Keywords :
Internet; natural language processing; search engines; Chinese comment; Internet; LDA; LPCE; PageRank method; condition entropy; conditional entropy; model reliability; sentiment analysis; unsupervised method; Algorithm design and analysis; Clustering algorithms; Educational institutions; Entropy; Feature extraction; Heuristic algorithms; Sentiment analysis; LDA; NLP; PageRank; TF-IDF; conditional entropy;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3278-8
DOI :
10.1109/ICSESS.2014.6933589