DocumentCode :
3428216
Title :
Extraction of coexpression relationship among genes from biomedical text using dynamic conditional random fields
Author :
Tiwari, Richa ; Zhang, Chengcui ; Chen, Wei-Bang
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Alabama at Birmingham, Birmingham, AL, USA
fYear :
2009
fDate :
2-5 Aug. 2009
Firstpage :
1
Lastpage :
4
Abstract :
Text mining tools and algorithms are being successfully used for information extraction especially on large corpus like biomedical publications. These tools not only aid in information extraction but also in forming new theories and relationships between various fields of biomedical research. Extraction of gene-gene or gene-disease relationship is one such application. In this paper, we introduce a method to detect coexpressed genes from text, using the grammatical dependencies among the words within sentences and Dynamic Conditional Random Fields (DCRFs). Determining the coexpression relationship between and among genes can help in identifying important concepts such as the functionality of gene(s) involved, their pathogenic mechanism, and in deciphering protein-protein interactions. This work attempts to extract relevant sentences by labeling the genes involved as well as the word representing the relationship, from full-length papers collected from PubMed. The results obtained were compared with that of Support Vector Machine (SVM) and Nearest Neighbor with generalization (NNge), and have been found to outperform both.
Keywords :
data mining; genetics; grammars; information retrieval; medical computing; random processes; text analysis; PubMed; biomedical text mining; coexpression relationship information extraction; dynamic conditional random field; gene-disease relationship; grammatical dependency; large corpus; nearest neighbor; pathogenic mechanism; protein-protein interaction; support vector machine; Biomedical computing; Data mining; Graphical models; Hidden Markov models; Nearest neighbor searches; Proteins; Support vector machine classification; Support vector machines; Testing; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems, 2009. CBMS 2009. 22nd IEEE International Symposium on
Conference_Location :
Albuquerque, NM
ISSN :
1063-7125
Print_ISBN :
978-1-4244-4879-1
Electronic_ISBN :
1063-7125
Type :
conf
DOI :
10.1109/CBMS.2009.5255388
Filename :
5255388
Link To Document :
بازگشت