Title :
Language model based Chinese financial news sentiment classification
Author :
Xu, Jun ; Xu, Rui-feng ; Wang, Xiao-long
Author_Institution :
Dept. of Comput. Sci. & Technol., Harbin Inst. of Technol. Shenzhen Grad. Sch., Harbin, China
Abstract :
This paper address the problem of identifying the sentiment polarity in financial news articles about a public company having potential effect on the future price of the company´s stock. The problem is challenging due to the lack of reliable labeled training data and effective classification method. A feasible corpus building strategy is proposed and stock reviews are used for training, since the news polarity prediction is similar to the process of stock analyst drawing their conclusion by weighting the major event pros and cons of the company. The reviews can be annotated automatically by the grade given by the analyst. In addition, the consequent experiments also confirm it. Furthermore, we examine the effectiveness of using language modeling approaches to solve the sentiment classification of Chinese financial news articles. Two different approaches based on language model are employed and their comparisons with SVM and Naive Bayes are also performed in our research. The experiment results justify the effectiveness and robustness of the proposed language model approaches, which perform better than the approaches based on traditional machine learning techniques.
Keywords :
classification; financial data processing; learning (artificial intelligence); natural language processing; public administration; stock markets; support vector machines; Chinese financial news articles; SVM; classification method; company stock; corpus building strategy; language model approaches; language model based Chinese financial news sentiment classification; language modeling approaches; machine learning techniques; naive Bayes; news polarity prediction; public company; reliable labeled training data; robustness; sentiment polarity; stock analyst; stock reviews; Abstracts; Investments; Market research; Measurement; Niobium; Noise; Support vector machines; Financial text analysis; Language model; Probabilistic measure; Sentiment analysis;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
Print_ISBN :
978-1-4673-1484-8
DOI :
10.1109/ICMLC.2012.6359687