Title :
PPIExtractor: A protein-protein interaction Extractor for biomédical literature
Author :
Yang, Zhihao ; Zhao, Zhehuan ; Li, Yanpeng ; Hu, Yuncui ; Lin, Hongfei
Author_Institution :
Coll. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
Abstract :
Knowledge about protein-protein interactions (PPIs) unveils the molecular mechanisms of biological processes. In this paper, we present a PPI extraction system, termed PPIExtractor, which automatically extracts PPIs from biomedical text and visualizes them. Given a Medline record dataset, PPIExtractor first applies Feature Coupling Generalization (FCG) to tag protein names, next uses the extended semantic similarity-based method to normalize them, then combines feature-based, convolution tree and graph kernels to extract PPIs, and finally visualizes the PPI network. Experimental evaluations show that PPIExtractor can achieve state-of-the-art performance on a DIP subset with respect to comparable evaluations. PPIExtractor is freely available for academic purposes at: http://202.118.75.18:8080/PPIExtractor/.
Keywords :
bioinformatics; biological techniques; computational linguistics; data visualisation; decision trees; graph theory; molecular biophysics; natural language processing; proteins; semantic networks; text analysis; word processing; Medline record dataset; PPI extraction system; PPIExtractor; biological process; biomedical literature; biomedical text; extended semantic similarity-based method; feature coupling generalization; feature-based convolution tree; graph kernels; molecular mechanisms; protein names; protein-protein interaction extractor; text visualization; Convolution; Dictionaries; Electronics packaging; Feature extraction; Kernel; Protein engineering; Proteins; Feature Coupling Generalization; Information extraction; Multiple kernels learning; Protein-protein interaction;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
DOI :
10.1109/BIBM.2012.6392739