DocumentCode :
18102
Title :
CLOpinionMiner: Opinion Target Extraction in a Cross-Language Scenario
Author :
Xinjie Zhou ; Xiaojun Wan ; Jianguo Xiao
Author_Institution :
MOE Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
Volume :
23
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
619
Lastpage :
630
Abstract :
Opinion target extraction is a subtask of opinion mining which is very useful in many applications. The problem has usually been solved by training a sequence labeler on manually labeled data. However, the labeled training datasets are imbalanced in different languages, and the lack of labeled corpus in a language limits the research progress on opinion target extraction in this language. In order to address the above problem, we propose a novel system called CLOpinionMiner which investigates leveraging the rich labeled data in a source language for opinion target extraction in a different target language. In this study, we focus on English-to-Chinese cross-language opinion target extraction. Based on the English dataset, our method produces two Chinese training datasets with different features. Two labeling models for Chinese opinion target extraction are trained based on Conditional Random Fields (CRF). After that, we use a monolingual co-training algorithm to improve the performance of both models by leveraging the enormous unlabeled Chinese review texts on the web. Experimental results show the effectiveness of our proposed approach.
Keywords :
data mining; language translation; linguistics; natural language processing; random processes; text analysis; CLOpinionMiner; CRF; Chinese review texts; Chinese training datasets; English dataset; English-to-Chinese cross-language opinion target extraction; conditional random fields; cross-language scenario; labeled corpus; labeled data; labeled training datasets; labeling models; monolingual co-training algorithm; opinion mining; sequence labeler training; Cameras; Data mining; Data models; Feature extraction; Information retrieval; Labeling; Training; Cross-language information extraction; opinion mining; opinion target extraction;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2392381
Filename :
7009977
Link To Document :
بازگشت