DocumentCode
3519901
Title
Improvement of Survival Prediction from Gene Expression Profiles by Mining of Prior Knowledge
Author
Ren, Siyuan ; Obradovic, Zoran
Author_Institution
Center for Inf. Sci. & Technol., Temple Univ., Philadelphia, PA
fYear
2008
fDate
3-5 Nov. 2008
Firstpage
372
Lastpage
375
Abstract
Identification of a small set of discriminative genes is a crucial step for effective prediction of disease or patient survival based on microarray gene expression data. Previous approaches to this problem are mainly based on analyzing differential gene expression data. In this work, an additional step is introduced to take advantage of prior knowledge about the relation of genes and a disease. In the proposed approach, keyword scanning of human proteins at the Swissprot database is performed to select genes related to the disease of interest followed by analysis of differential gene expressions. In results obtained on lung cancer data where a differential expression-based selection of genes is fairly inaccurate, our prior knowledge mining based approach offered a large improvement of prediction accuracy (0.74 vs. 0.58 ROC curve when using 20 genes). Furthermore, experimental results on a breast cancer dataset, where prediction based on differential gene expression alone was quite accurate can be further improved by integrating with our new approach.
Keywords
cancer; data mining; database management systems; genetics; lung; medical computing; proteins; tumours; Swissprot database; breast cancer; differential expression-based selection; gene expression profile; gene identification; knowledge mining; lung cancer; microarray gene expression; survival prediction; Accuracy; Bioinformatics; Breast; Cancer; Data analysis; Diseases; Gene expression; Lungs; Neural networks; Testing; Feature selection; classification; gene expression analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-0-7695-3452-7
Type
conf
DOI
10.1109/BIBM.2008.53
Filename
4684922
Link To Document