DocumentCode
2319430
Title
Language model based Chinese financial news sentiment classification
Author
Xu, Jun ; Xu, Rui-feng ; Wang, Xiao-long
Author_Institution
Dept. of Comput. Sci. & Technol., Harbin Inst. of Technol. Shenzhen Grad. Sch., Harbin, China
Volume
5
fYear
2012
fDate
15-17 July 2012
Firstpage
2025
Lastpage
2030
Abstract
This paper address the problem of identifying the sentiment polarity in financial news articles about a public company having potential effect on the future price of the company´s stock. The problem is challenging due to the lack of reliable labeled training data and effective classification method. A feasible corpus building strategy is proposed and stock reviews are used for training, since the news polarity prediction is similar to the process of stock analyst drawing their conclusion by weighting the major event pros and cons of the company. The reviews can be annotated automatically by the grade given by the analyst. In addition, the consequent experiments also confirm it. Furthermore, we examine the effectiveness of using language modeling approaches to solve the sentiment classification of Chinese financial news articles. Two different approaches based on language model are employed and their comparisons with SVM and Naive Bayes are also performed in our research. The experiment results justify the effectiveness and robustness of the proposed language model approaches, which perform better than the approaches based on traditional machine learning techniques.
Keywords
classification; financial data processing; learning (artificial intelligence); natural language processing; public administration; stock markets; support vector machines; Chinese financial news articles; SVM; classification method; company stock; corpus building strategy; language model approaches; language model based Chinese financial news sentiment classification; language modeling approaches; machine learning techniques; naive Bayes; news polarity prediction; public company; reliable labeled training data; robustness; sentiment polarity; stock analyst; stock reviews; Abstracts; Investments; Market research; Measurement; Niobium; Noise; Support vector machines; Financial text analysis; Language model; Probabilistic measure; Sentiment analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6359687
Filename
6359687
Link To Document