• DocumentCode
    399587
  • Title

    Extracting biochemical interactions from MEDLINE using a link grammar parser

  • Author

    Ding, Jing ; Berleant, Daniel ; Xu, Jun ; Fulmer, Andy W.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Iowa State Univ., Ames, IA, USA
  • fYear
    2003
  • fDate
    3-5 Nov. 2003
  • Firstpage
    467
  • Lastpage
    471
  • Abstract
    Many natural language processing approaches at various complexity levels have been reported for extracting biochemical interactions from MEDLINE. While some algorithms using simple template matching are unable to deal with the complex syntactic structures, others exploiting sophisticated parsing techniques are hindered by greater computational cost. This study investigates link grammar parsing for extracting biochemical interactions. Link grammar parsing can handle many syntactic structures and is computationally relatively efficient. We experimented on a sample MEDLINE corpus. Although the parser was originally developed for conversational English and made many mistakes in parsing sentences from the biochemical domain, it nevertheless achieved better overall performance than a co-occurrence-only method. Customizing the parser for the biomedical domain is expected to improve its performance further.
  • Keywords
    biochemistry; computational linguistics; data mining; grammars; medical information systems; natural languages; MEDLINE; biochemical interaction; biomedical domain; computational cost; link grammar parsing; natural language processing; syntactic structure; template matching; Abstracts; Amino acids; Biomedical computing; Computational complexity; Computational efficiency; Databases; Humans; Insulin; Natural language processing; Proteins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2003. Proceedings. 15th IEEE International Conference on
  • ISSN
    1082-3409
  • Print_ISBN
    0-7695-2038-3
  • Type

    conf

  • DOI
    10.1109/TAI.2003.1250226
  • Filename
    1250226