DocumentCode
983782
Title
Literature extraction of protein functions using sentence pattern mining
Author
Chiang, Jung-Hsien ; Yu, Hsu-Chun
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume
17
Issue
8
fYear
2005
Firstpage
1088
Lastpage
1098
Abstract
With the rapid growth of articles of genomics research, it has become a challenge for biomedical researchers to access this ever-increasing quantity of information to understand the newest discovery of functions of proteins they are studying. To facilitate functional annotation of proteins by utilizing the huge amounts of biomedical literature and transforming the knowledge into easily accessible database formats, the text mining technique thus becomes essential. In this paper, we propose the method of sentence pattern mining to extract protein functions from biomedical literature. To recognize variants of function terms correctly, we identify morphological, syntactic, and semantic variation forms. The proposed methods can be used to aid database curators in annotating protein functions and to assist biologists and medical researchers in searching protein functions from biomedical literature.
Keywords
biology computing; computational linguistics; data mining; genetics; medical information systems; proteins; text analysis; bioinformatics; biomedical literature; genomics; knowledge acquisition; linguistic processing; protein functions; sentence pattern mining; text mining technique; Bioinformatics; Data mining; Databases; Diseases; Genomics; Knowledge acquisition; Ontologies; Organisms; Proteins; Text mining; Index Terms- Text mining; bioinformatics; knowledge acquisition; linguistic processing.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2005.132
Filename
1458702
Link To Document