DocumentCode
1966915
Title
A knowledge-based approach to citation extraction
Author
Day, Min-Yuh ; Tsai, Tzong-Han ; Sung, Cheng-Lung ; Lee, Cheng-Wei ; Wu, Shih-Hung ; Ong, Chorng-Shyong ; Hsu, Wen-Lian
Author_Institution
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
fYear
2005
fDate
15-17 Aug. 2005
Firstpage
50
Lastpage
55
Abstract
Integration of the bibliographical information of scholarly publications available on the Internet is an important task in academic research. To accomplish this task, accurate reference metadata extraction for scholarly publications is essential for the integration of information from heterogeneous reference sources. In this paper, we propose a knowledge-based approach to literature mining and focus on reference metadata extraction methods for scholarly publications. We adopt an ontological knowledge representation framework called INFOMAP to automatically extract the reference metadata. The experimental results show that, by using INFOMAP, we can extract author, title, journal, volume, number (issue), year, and page information from different reference styles with a high degree of accuracy. The overall average field accuracy of citation extraction for a bioinformatics dataset is 97.87% for six reference styles.
Keywords
Internet; bibliographies; citation analysis; data mining; electronic publishing; information retrieval; literature; meta data; ontologies (artificial intelligence); INFOMAP; Internet; bibliographical information; citation extraction; information integration; knowledge-based approach; literature mining; ontological knowledge representation; reference metadata extraction; scholarly publication; Computer science; Data mining; Databases; Information management; Information science; Internet; Knowledge engineering; Knowledge representation; Ontologies; Particle separators;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration, Conf, 2005. IRI -2005 IEEE International Conference on.
Print_ISBN
0-7803-9093-8
Type
conf
DOI
10.1109/IRI-05.2005.1506448
Filename
1506448
Link To Document