DocumentCode
3714455
Title
Training word embeddings for deep learning in biomedical text mining tasks
Author
Zhenchao Jiang;Lishuang Li;Degen Huang; Liuke Jin
Author_Institution
School of Computer Science and Technology, Dalian University of Technology, China
fYear
2015
Firstpage
625
Lastpage
628
Abstract
Most word embedding methods are proposed with general purpose which take a word as a basic unit and learn embeddings according to words´ external contexts. However, in biomedical text mining, there are many biomedical entities and syntactic chunks which contain rich domain information, and the semantic meaning of a word is also strongly related to those information. Hence, we present a biomedical domain-specific word embedding model by incorporating stem, chunk and entity to train word embeddings. We also present two deep learning architectures respectively for two biomedical text mining tasks, by which we evaluate our word embeddings and compare them with other models. Experimental results show that our biomedical domain-specific word embeddings overall outperform other general-purpose word embeddings in these deep learning methods for biomedical text mining tasks.
Keywords
"Syntactics","Training","Proteins","Protein engineering"
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BIBM.2015.7359756
Filename
7359756
Link To Document