• DocumentCode
    983782
  • Title

    Literature extraction of protein functions using sentence pattern mining

  • Author

    Chiang, Jung-Hsien ; Yu, Hsu-Chun

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    17
  • Issue
    8
  • fYear
    2005
  • Firstpage
    1088
  • Lastpage
    1098
  • Abstract
    With the rapid growth of articles of genomics research, it has become a challenge for biomedical researchers to access this ever-increasing quantity of information to understand the newest discovery of functions of proteins they are studying. To facilitate functional annotation of proteins by utilizing the huge amounts of biomedical literature and transforming the knowledge into easily accessible database formats, the text mining technique thus becomes essential. In this paper, we propose the method of sentence pattern mining to extract protein functions from biomedical literature. To recognize variants of function terms correctly, we identify morphological, syntactic, and semantic variation forms. The proposed methods can be used to aid database curators in annotating protein functions and to assist biologists and medical researchers in searching protein functions from biomedical literature.
  • Keywords
    biology computing; computational linguistics; data mining; genetics; medical information systems; proteins; text analysis; bioinformatics; biomedical literature; genomics; knowledge acquisition; linguistic processing; protein functions; sentence pattern mining; text mining technique; Bioinformatics; Data mining; Databases; Diseases; Genomics; Knowledge acquisition; Ontologies; Organisms; Proteins; Text mining; Index Terms- Text mining; bioinformatics; knowledge acquisition; linguistic processing.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.132
  • Filename
    1458702