DocumentCode :
1784912
Title :
A general instance representation architecture for protein-protein interaction extraction
Author :
Lishuang Li ; Zhenchao Jiang ; Degen Huang
Author_Institution :
Sch. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
fYear :
2014
fDate :
2-5 Nov. 2014
Firstpage :
497
Lastpage :
500
Abstract :
Previous researches have shown that supervised Protein-Protein Interaction Extraction (PPIE) can get high accuracies with elaborately selected features and kernels. However, most features and kernels rest upon domain knowledge and natural language analysis, which makes the supervised model expensive, heavy and brittle. Moreover, the one-hot encoding, a commonly used representation technique, fails to capture the semantic similarity between words. To reduce the manual labor and overcome the shortage of one-hot encoding, we put forward a general instance representation architecture for PPIE, which integrates word representation and vector composition. Our method obtains F-scores of 69.4%, 78.8%, 76.0%, 74.0% and 81.1% on AIMed, BioInfer, HPRD50, IEPA and LLL respectively.
Keywords :
bioinformatics; feature selection; learning (artificial intelligence); natural languages; proteins; proteomics; semantic networks; domain knowledge analysis; feature selection; general instance representation architecture; kernel selection; natural language analysis; semantic similarity; supervised protein-protein interaction extraction; vector composition integration; word representation integration; Encoding; Feature extraction; Kernel; Protein engineering; Proteins; Skeleton; Vectors; Protein-Protein Interaction; instance representation; relation extraction; word representation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
Type :
conf
DOI :
10.1109/BIBM.2014.6999208
Filename :
6999208
Link To Document :
بازگشت