DocumentCode :
3345455
Title :
Semi-supervised Chinese contextual polarity classification with automatic feature selection
Author :
Ge Xu ; Houfeng Wang
Author_Institution :
Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
Volume :
2
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1019
Lastpage :
1023
Abstract :
Common approaches to the tasks of sentiment analysis start with a list of words with prior polarities, which are context-free. However, a word can exhibit different polarities in different contexts, which are termed as contextual polarities. In this paper, viewing polarities of words as properties of word senses, we treat the Chinese contextual polarity classification as word sense disambiguation (WSD), and manually labeled a Chinese dataset for training and testing. Due to the insufficiency of labeled data, semi-supervised methods are adopted; to find the effective features for the contextual polarity classification, two automatic feature selection algorithms are proposed. We combine the semi-supervised methods with automatic feature selection algorithms in order to utilize the strengths of both. The experimental results show that the semi-supervised methods, automatic feature selection, and the combination of both help to improve the Chinese contextual polarity classification above a supervised baseline model.
Keywords :
feature extraction; natural language processing; pattern classification; automatic feature selection; contextual polarities; labeled data insufficiency; semi supervised Chinese contextual polarity classification; sentiment analysis; supervised baseline model; viewing polarities; word sense disambiguation; Accuracy; Classification algorithms; Computational linguistics; Machine learning; Niobium; Support vector machines; Training; automatic feature selection; contextual polarity; prior polarity; semi-supervised learning; word sense disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location :
Shanghai
ISSN :
2157-9555
Print_ISBN :
978-1-4244-9950-2
Type :
conf
DOI :
10.1109/ICNC.2011.6022245
Filename :
6022245
Link To Document :
بازگشت