• DocumentCode
    2460446
  • Title

    Evolutional Dependency Parse Trees for Biological Relation Extraction

  • Author

    Kao, Hung-Yu ; Tang, Yi-Tsung ; Wang, Jian-Fu

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2011
  • fDate
    24-26 Oct. 2011
  • Firstpage
    167
  • Lastpage
    174
  • Abstract
    Due to the rapid growth in biological technology, the development of high-quality information extraction systems is needed and still remains a challenge. Several recently proposed approaches to biological relation extraction are based on machine learning techniques on lexical and syntactic information. Most use the dependency path between two genes/proteins instead of the whole dependency tree of a sentence for identifying relationships. However, the dependency path may not have any node between two entities. If a limited set of annotated training corpora is used for the construction of tree information of biological relationships, the training corpus will lack some sentence structures and cannot predict whether the sentence has a biological relationship. In this paper, we developed a biological relation extraction system called Evolutional Tree Extraction System - ETree. We extended the dependency path to the dependency subtree and developed a method that can automatically expand and prune these existing dependency subtrees into various dependency subtrees. These dependency subtrees are called "Evolutional Trees" and are used to predict the biological relationship sentences.
  • Keywords
    biochemistry; biological techniques; biology computing; genetics; information technology; learning (artificial intelligence); molecular biophysics; proteins; annotated training corpora; biological relation extraction system; biological relationship sentences; biological technology; evolutional dependency parse trees; evolutional tree extraction system; genes; high-quality information extraction systems; lexical information; machine learning techniques; proteins; syntactic information; training corpus; tree information; Data mining; Feature extraction; Kernel; Proteins; Syntactics; Training; gene regulation; relation extraction; text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2011 IEEE 11th International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-61284-975-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2011.33
  • Filename
    6089824