DocumentCode :
79504
Title :
Parametric Estimation of the Local False Discovery Rate for Identifying Genetic Associations
Author :
Ye Yang ; Aghababazadeh, Farnoosh Abbas ; Bickel, David R.
Author_Institution :
Bank of Nova Scotia (Scotiabank), Toronto, ON, Canada
Volume :
10
Issue :
1
fYear :
2013
fDate :
Jan.-Feb. 2013
Firstpage :
98
Lastpage :
108
Abstract :
Many genome-wide association studies have been conducted to identify single nucleotide polymorphisms (SNPs) that are associated with particular diseases or other traits. The local false discovery rate (LFDR) estimated using semiparametric models has enjoyed success in simultaneous inference. However, semiparametric LFDR estimators can be biased because they tend to overestimate the proportion of the nonassociated SNPs. We address the problem by adapting a simple parametric mixture model (PMM) and by comparing this model to the semiparametric mixture model (SMM) behind an LFDR estimator that is known to be conservatively biased. Then, we also compare the PMM with a parametric nonmixture model (PNM). In our simulation studies, we thoroughly analyze the performances of the three models under different values of p1, a prior probability that is approximately equal to the proportion of SNPs that are associated with the disease. When p1 > 10%, the PMM generally performs better than the SMM. When p1 <; 0:1%, the SMM outperforms PMM. When p1 lies between 0.1 and 10 percent, both methods have about the same performance. In that setting, the PMM may be preferred since it has the advantage of supplying an estimate of the detectability level of the nonassociated SNPs.
Keywords :
bioinformatics; diseases; genomics; polymorphism; LFDR estimation; disease; genetic association identification; genome wide association; local false discovery rate; parametric estimation; parametric nonmixture model; semiparametric mixture model; semiparametric model; simple parametric mixture model; single nucleotide polymorphism; Adaptation models; Analytical models; Bioinformatics; Diseases; Estimation; Solid modeling; Standards; Empirical Bayes; MDL; Type II maximum likelihood; genome-wide association studies; local false discovery rate; minimum description length; reduced likelihood; strength of statistical evidence; Bayes Theorem; Computational Biology; Computer Simulation; Genome-Wide Association Study; Humans; Models, Genetic; Models, Statistical; Polymorphism, Single Nucleotide;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.140
Filename :
6365171
Link To Document :
بازگشت