Title :
Leaning to Train: Linking Financial News Articles to Company Short Names
Author :
Yunqing Xia ; Haizhou Lin ; Yi Liu ; Lau, Roslyn
Author_Institution :
Dept. of Comp. Sci. & Tech., Tsinghua Univ., Beijing, China
Abstract :
As a special type of named entity, company name is frequently mentioned in financial news articles, leading to significant necessity on company-oriented information retrieval and management. However, company names are usually mentioned with short names, which are sometimes ambiguous. For example, apple refers in some cases to Apple Incorporation while in other cases to a kind of sweet fruit. This motivates our research on linking financial news articles to company short name, which aims to determine whether a mention in an article is short name of a company. The supervised approach requires labor on annotation of news article that mention the specific company short name. It is rather unpractical as new company short names appear constantly. In this work, we propose a self-contained unsupervised learning framework, which relies on probabilistic topic model to collect training data automatically. Experimental results show that the performance is close to the state-of-the-art supervised approach which relies on human-judged gold standard.
Keywords :
information management; information retrieval; unsupervised learning; Apple Incorporation; company-oriented information management; company-oriented information retrieval; link financial news articles; probabilistic topic model; self-contained unsupervised learning framework; supervised approach; Companies; Data models; Gold; Joining processes; Standards; Support vector machines; Training data; Company short name; disambiguation; entity linking;
Conference_Titel :
e-Business Engineering (ICEBE), 2014 IEEE 11th International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4799-6562-5
DOI :
10.1109/ICEBE.2014.48