DocumentCode :
3439411
Title :
Learning the Roles of Directional Expressions and Domain Concepts in Financial News Analysis
Author :
Malo, Pedro ; Sinha, Aloka ; Takala, Pyry ; Ahlgren, Oskar ; Lappalainen, Iivari
Author_Institution :
Sch. of Bus., Dept. of Inf. & Service Econ., Aalto Univ., Aalto, Finland
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
945
Lastpage :
954
Abstract :
Rapid development of natural language processing technologies has paved way for automatic sentiment analysis and emergence of robo-readers in computational finance. However, the technology is still in its nascent state. Distilling sentiment information from unstructured sources has turned out to be a complicated and strongly domain-dependent problem. To emulate the human ability to recognize financial sentiments in natural language by using machines, we need to provide them with (i) necessary ontological knowledge on the relevant domain-concepts, and (ii) learning strategies that help the machines to combine this knowledge with the syntactic structures extracted from text. In this paper, we present a knowledge-driven tree kernel framework for sentence-level analysis of financial news sentiments. Comparisons with linear kernels and classical lexicon-based systems suggest that significant performance gains can be achieved by incorporating information on financial concepts and their grammatical context. The framework is decomposable into learning, knowledge and syntactic structure components. Contribution of each part is separately examined using a human-annotated phrase-bank with close to 5000 sentences collected across a number of financial news sources. The proposed sentiment analysis framework is flexible and can be applied also outside financial domain. To evaluate cross-domain performance, a further comparison of the algorithms is done with datasets from non-financial domains including movie reviews and general political discussions.
Keywords :
financial data processing; learning (artificial intelligence); natural language processing; ontologies (artificial intelligence); automatic sentiment analysis; computational finance; cross-domain performance; directional expressions; domain concepts; domain-dependent problem; financial news analysis; general political discussions; grammatical context; human-annotated phrase-bank; knowledge-driven tree kernel framework; learning strategies; lexicon-based systems; movie reviews; natural language processing; non-financial domains; ontological knowledge; robo-readers; syntactic structures; Context; Data mining; Databases; Economics; Kernel; Semantics; Syntactics; computational finance; polarity; sentiment analysis; support vector machines; tree kernels;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
Type :
conf
DOI :
10.1109/ICDMW.2013.36
Filename :
6754023
Link To Document :
بازگشت