DocumentCode
3345455
Title
Semi-supervised Chinese contextual polarity classification with automatic feature selection
Author
Ge Xu ; Houfeng Wang
Author_Institution
Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
Volume
2
fYear
2011
fDate
26-28 July 2011
Firstpage
1019
Lastpage
1023
Abstract
Common approaches to the tasks of sentiment analysis start with a list of words with prior polarities, which are context-free. However, a word can exhibit different polarities in different contexts, which are termed as contextual polarities. In this paper, viewing polarities of words as properties of word senses, we treat the Chinese contextual polarity classification as word sense disambiguation (WSD), and manually labeled a Chinese dataset for training and testing. Due to the insufficiency of labeled data, semi-supervised methods are adopted; to find the effective features for the contextual polarity classification, two automatic feature selection algorithms are proposed. We combine the semi-supervised methods with automatic feature selection algorithms in order to utilize the strengths of both. The experimental results show that the semi-supervised methods, automatic feature selection, and the combination of both help to improve the Chinese contextual polarity classification above a supervised baseline model.
Keywords
feature extraction; natural language processing; pattern classification; automatic feature selection; contextual polarities; labeled data insufficiency; semi supervised Chinese contextual polarity classification; sentiment analysis; supervised baseline model; viewing polarities; word sense disambiguation; Accuracy; Classification algorithms; Computational linguistics; Machine learning; Niobium; Support vector machines; Training; automatic feature selection; contextual polarity; prior polarity; semi-supervised learning; word sense disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location
Shanghai
ISSN
2157-9555
Print_ISBN
978-1-4244-9950-2
Type
conf
DOI
10.1109/ICNC.2011.6022245
Filename
6022245
Link To Document