• DocumentCode
    3714455
  • Title

    Training word embeddings for deep learning in biomedical text mining tasks

  • Author

    Zhenchao Jiang;Lishuang Li;Degen Huang; Liuke Jin

  • Author_Institution
    School of Computer Science and Technology, Dalian University of Technology, China
  • fYear
    2015
  • Firstpage
    625
  • Lastpage
    628
  • Abstract
    Most word embedding methods are proposed with general purpose which take a word as a basic unit and learn embeddings according to words´ external contexts. However, in biomedical text mining, there are many biomedical entities and syntactic chunks which contain rich domain information, and the semantic meaning of a word is also strongly related to those information. Hence, we present a biomedical domain-specific word embedding model by incorporating stem, chunk and entity to train word embeddings. We also present two deep learning architectures respectively for two biomedical text mining tasks, by which we evaluate our word embeddings and compare them with other models. Experimental results show that our biomedical domain-specific word embeddings overall outperform other general-purpose word embeddings in these deep learning methods for biomedical text mining tasks.
  • Keywords
    "Syntactics","Training","Proteins","Protein engineering"
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/BIBM.2015.7359756
  • Filename
    7359756