DocumentCode :
79901
Title :
Statistical Detection of Intrinsically Multivariate Predictive Genes
Author :
Ting Chen ; Braga-Neto, Ulisses M.
Author_Institution :
Emmes Corp., Rockville, MD, USA
Volume :
12
Issue :
4
fYear :
2015
fDate :
July-Aug. 1 2015
Firstpage :
951
Lastpage :
964
Abstract :
Canalizing genes possess broad regulatory power over a wide swath of regulatory processes. On the other hand, it has been hypothesized that the phenomenon of intrinsically multivariate prediction (IMP) is associated with canalization. However, applications have relied on user-selectable thresholds on the IMP score to decide on the presence of IMP. A methodology is developed here that avoids arbitrary thresholds, by providing a statistical test for the IMP score. In addition, the proposed procedure allows the incorporation of prior knowledge if available, which can alleviate the problem of loss of power due to small sample sizes. The issue of multiplicity of tests is addressed by family-wise error rate (FWER) and false discovery rate (FDR) controlling approaches. The proposed methodology is demonstrated by experiments using synthetic and real gene-expression data from studies on melanoma and ionizing radiation (IR) responsive genes. The results with the real data identified DUSP1 and p53, two well-known canalizing genes associated with melanoma and IR response, respectively, as the genes with a clear majority of IMP predictor pairs. This validates the potential of the proposed methodology as a tool for discovery of canalizing genes from binary gene-expression data. The procedure is made available through an R package.
Keywords :
cancer; genetics; skin; statistical analysis; DUSP1; IMP predictor pairs; IMP score; R package; broad regulatory power; canalizing genes; false discovery rate controlling approaches; family-wise error rate; intrinsically multivariate predictive genes; ionizing radiation responsive genes; melanoma; p53; real gene-expression data; regulatory processes; statistical detection; statistical test; synthetic gene-expression data; user-selectable thresholds; Bioinformatics; Computational biology; IEEE transactions; Irrigation; Logic gates; Stochastic processes; Testing; Coefficient of Determination; Coefficient of determination; Intrinsically Multivariate Predictive Genes; Multiple Testing Procedure; Stochastic Logic Model; intrinsically multivariate predictive genes; multiple testing procedure; stochastic logic model;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2014.2377731
Filename :
6977961
Link To Document :
بازگشت