DocumentCode
130875
Title
An unsupervised method to extract Chinese comment targets automatically
Author
Xiangrong Quan ; Guoshi Wu ; Zilin He ; Lu Huang
Author_Institution
Sch. of Software, Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
fDate
27-29 June 2014
Firstpage
389
Lastpage
392
Abstract
With the Internet developing, the quantity of comments accumulates rapidly, and sentiment analysis becomes a hot research task. A key preparation for sentiment analysis is to extract comment targets. In this paper an unsupervised method is proposed to judging the degree of their importance. We separate the method into two parts. One part uses explicit phrases´ cooccurrence, calculating conditional entropy. We take nouns and adjectives into consideration, and make full use of their relevance. The other part mines the relationship between clusters and terms based on LDA result, using a modified PageRank method. The two algorithms are combined to rescore these candidate targets. In the end, we use two ways to test our LPCE (LDA, PageRank, and Condition Entropy) model´s reliability, including word aspect and sentence aspect. Experiment results prove LPCE model is reasonable and effective.
Keywords
Internet; natural language processing; search engines; Chinese comment; Internet; LDA; LPCE; PageRank method; condition entropy; conditional entropy; model reliability; sentiment analysis; unsupervised method; Algorithm design and analysis; Clustering algorithms; Educational institutions; Entropy; Feature extraction; Heuristic algorithms; Sentiment analysis; LDA; NLP; PageRank; TF-IDF; conditional entropy;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location
Beijing
ISSN
2327-0586
Print_ISBN
978-1-4799-3278-8
Type
conf
DOI
10.1109/ICSESS.2014.6933589
Filename
6933589
Link To Document