Title :
CiteXtract: Extracting Citation Data from Biomedical Literature
Author :
Nikolov, Nikolay ; Stoehr, Peter ; Zhu, Weimin ; Rijnbeek, Mark ; Pillai, Sharmila
Author_Institution :
Eur. Bioinformatics Inst., Cambridge
Abstract :
We present a system for extracting citation data from Pubmed-indexed papers available online based on a knowledge-based algorithm. We achieve nearly 92% accuracy on a sample of 156 papers from 78 different journals. We describe the issues faced, our approach, the results achieved and the future directions of our work.
Keywords :
database management systems; knowledge based systems; medical information systems; search engines; CiteXtract; Pubmed-indexed papers; biomedical literature; citation data extraction; knowledge-based algorithm; Bibliographies; Bioinformatics; Books; Data mining; Genomics; HTML; Hydrogen; Navigation;
Conference_Titel :
Computer-Based Medical Systems, 2007. CBMS '07. Twentieth IEEE International Symposium on
Conference_Location :
Maribor
Print_ISBN :
0-7695-2905-4
DOI :
10.1109/CBMS.2007.31