Title :
Mining Gene Expression Database for Primary Human Disease Tissues
Author :
Campen, Andrew ; Xia, Yuni ; Rigsby, Dan ; Guo, Ying ; Feng, Xingdong ; Su, Eric W. ; Palakal, Mathew ; Li, Shuyu
Author_Institution :
Dept. of Comput. Sci., Indiana Univ., Indianapolis, IN
Abstract :
Studies of gene expression in primary human disease tissue often span several years in order to achieve reasonably large sample sizes and to collect patient clinical information making this data particularly valuable. Due to the lack of a central repository, this data has only been available through disparate and non-publicly accessible sources following publication. We developed disease-to-gene expression mapper (D-GEM) as a publically accessible database and data mining toolbox for microarray data of human primary disease tissue. A statistical pipeline has also been implemented to identify genes over-expressed in disease tissue samples in comparison with normal control samples, or genes whose expression values are associated with clinical parameters such as patient survival rate. One potential application of this data is the identification of pathway specific cancer prognosis markers. By applying a novel, gene signatures for cancer prognosis in the context of known biological pathways in cancer development were identified and confirmed.
Keywords :
biological tissues; cancer; data mining; genetics; medical diagnostic computing; cancer development; cancer prognosis; disease-to-gene expression mapper; gene expression database mining; gene signatures; human primary disease tissue; microarray data; statistical pipeline; Cancer; Clinical diagnosis; Data analysis; Data mining; Databases; Diseases; Gene expression; Humans; Pipelines; Testing;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497632