Title :
OnTheFly 2.0: A tool for automatic annotation of files and biological information extraction
Author :
Pafilis, Evangelos ; Pavlopoulos, Georgios A. ; Satagopam, Venkata P. ; Papanikolaou, N. ; Horn, Heiko ; Arvanitidis, Christos ; Jensen, Lars Juhl ; Schneider, R. ; Iliopoulos, Ioannis
Author_Institution :
Inst. of Marine Biol., Biotechnol. & Aquacultures (IMBBC), Hellenic Center for Marine Res. (HCMR), Heraklion, Greece
Abstract :
Retrieving all of the necessary information from databases about bioentities mentioned in an article is not a trivial or an easy task. Following the daily literature about a specific biological topic and collecting all the necessary information about the bioentities mentioned in the literature manually is tedious and time consuming. OnTheFly 2.0 is a web application mainly designed for non-computer experts which aims to automate data collection and knowledge extraction from biological literature in a user friendly and efficient way. OnTheFly 2.0 is able to extract bioentities from individual articles such as text, Microsoft Word, Excel and PDF files. With a simple drag-and-drop motion, the text of a document is extensively parsed for bioentities such as protein/gene names and chemical compound names. Utilizing high quality data integration platforms, OnTheFly allows the generation of informative summaries, interaction networks and at-a-glance popup windows containing knowledge related to the bioentities found in documents. OnTheFly 2.0 provides a concise application to automate the extraction of bioentities hidden in various documents and is offered as a web based application. It can be found at: http://onthefly.embl.de, http://onthefly.med.uoc.gr or http://onthefly.hcmr.gr.
Keywords :
Internet; biology computing; data integration; genetics; information retrieval; knowledge acquisition; text analysis; Microsoft Excel files; Microsoft Word files; OnTheFly 2.0; PDF files; Web application; at-a-glance popup window generation; automatic file annotation; bioentity extraction; biological information extraction; biological topic; chemical compound names; data collection automation; data integration platforms; drag-and-drop motion; gene names; informative summary generation; interaction network generation; knowledge extraction automation; protein names; Bioinformatics; Databases; Electronic mail; Organisms; Protein engineering; Proteins;
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location :
Chania
DOI :
10.1109/BIBE.2013.6701679