Title :
Extraction of drug-disease relations from MEDLINE abstracts
Author :
Bchir, Aida ; Ben Abdessalem Karaa, Wahiba
Author_Institution :
High Inst. of Manage., Tunisian Univ., Tunis, Tunisia
Abstract :
Biological research literature, as in many other domains of human activity, is a rich source of knowledge. MEDLINE is a huge database of biomedical information and life sciences; it provides information in the form of abstracts and documents. However, extracting this information leads to various problems, related to the types of information such as recognition of all terms related to the domain of texts, concepts associated with them, as well as identifying the types of relationships. In this context, we suggest in this paper an approach to extract disease-drug relations: in a first step, we employ Natural Language Processing techniques for the abstracts´ preprocessing. In a second step we extract a set of features from the preprocessed abstracts. Finally we extract a disease-drug relation using machine learning classifier.
Keywords :
diseases; drugs; feature extraction; learning (artificial intelligence); medical information systems; natural language processing; Biological research literature; MEDLINE abstracts; biomedical information; drug disease relation extraction; feature extraction; human activity; knowledge source; machine learning classifier; natural language processing techniques; Abstracts; Data mining; Diseases; Drugs; Feature extraction; Protein engineering; Proteins;
Conference_Titel :
Computer and Information Technology (WCCIT), 2013 World Congress on
Conference_Location :
Sousse
Print_ISBN :
978-1-4799-0460-0
DOI :
10.1109/WCCIT.2013.6618759