Title :
Mining lexico-syntactic patterns to extract chemical entities with their associated properties
Author :
Eltyeb, Safaa ; Salim, Naomie
Author_Institution :
Fac. of Comput., Univ. Teknol. Malaysia, Skudai, Malaysia
Abstract :
Specific information on newly discovered compound is often difficult to be found in chemical databases. The chemical and drug literature is very rich with the information resulted from new chemical synthesis. This paper presents a survey on the types of approaches that have been used to extract information associated with chemical compounds from chemical and drug text. Thereafter, it gives a description for a novel pattern-based extraction method to be developed in the future taking into account specific types of information associated with chemical compounds not explored before in the automated extraction from a text. The paper focuses on the extraction of the properties that influence the bioavailability of drug candidates´ compounds. The result of this study can help the database curators in compiling the drug related chemical databases and the researchers to digest the huge amount of textual information which is growing rapidly.
Keywords :
chemistry computing; data mining; drugs; information retrieval; text analysis; automated extraction; bioavailability; chemical compounds; chemical entity extraction; chemical text; drug candidates compounds; drug text; lexico-syntactic pattern mining; pattern-based extraction method; Chemical compounds; Chemicals; Compounds; Data mining; Databases; Drugs; Information retrieval; Information extraction; chemical compounds; chemical databases; pattern-based approach;
Conference_Titel :
Computing, Electrical and Electronics Engineering (ICCEEE), 2013 International Conference on
Conference_Location :
Khartoum
Print_ISBN :
978-1-4673-6231-3
DOI :
10.1109/ICCEEE.2013.6633957