DocumentCode :
2201043
Title :
Term Extraction and Disambiguation for Semantic Knowledge Enrichment: A Case Study on Initial Public Offering (IPO) Prospectus Corpus
Author :
Jie Tao ; El-Gayar, Omar F. ; Deokar, Amit V. ; Yenling Chang
fYear :
2015
fDate :
5-8 Jan. 2015
Firstpage :
3719
Lastpage :
3728
Abstract :
Domain knowledge bases are a basis for advanced knowledge-based systems, manually creating a formal knowledge base for a certain domain is both resource consuming and non-trivial. In this paper, we propose an approach that provides support to extract, select, and disambiguate terms embedded in domain specific documents. The extracted terms are later used to en-rich existing ontologies/taxonomies, as well as to bridge domain specific knowledge base with a generic knowledge base such as Word Net. The proposed approach addresses two major issues in the term extraction domain, namely quality and efficiency. Also, the proposed approach adopts a feature-based method that assists in topic extraction and integration with existing ontologies in the given domain. The proposed approach is realized in a research prototype, and then a case study is conducted in order to illustrate the feasibility and the efficiency of the proposed method in the finance domain. A preliminary empirical validation by the domain experts is also conducted to determine the accuracy of the proposed approach. The results from the case study indicate the advantages and potential of the proposed approach.
Keywords :
data mining; investment; knowledge based systems; natural language processing; ontologies (artificial intelligence); IPO prospectus corpus; Word Net; feature-based method; initial public offering; knowledge-based system; ontologies; semantic knowledge enrichment; term disambiguation; term extraction; topic extraction; Equations; Feature extraction; Knowledge based systems; Logic gates; Ontologies; Semantics; Syntactics; Initial Public Offering; ontology enrichment; text analytics; word sense disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences (HICSS), 2015 48th Hawaii International Conference on
Conference_Location :
Kauai, HI
ISSN :
1530-1605
Type :
conf
DOI :
10.1109/HICSS.2015.448
Filename :
7070264
Link To Document :
بازگشت