Title :
Empirical bayes model comparisons for differential methylation analysis
Author :
Teng, Mingxiang ; Wang, Yadong ; Liu, Yunlong ; Kim, Seongho ; Balch, Curt ; Nephew, Kenneth P. ; Li, Lang
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
A number of empirical Bayes models were developed to investigate the differential methylation analysis. However, it is not clear which empirical Bayes model performs best in differential methylation analysis. In this paper, five empirical Bayes models were constructed and applied to the differential methylation analysis of A2780 cells between before and after 1, 3, and 5 round of cisplatin treatment. The log-normal model with the background variance showed the lowest minimized negative log-likelihood. It inferred increasing number of differentially methylated loci from 1 to 3 to 5 rounds of cisplatin treatment on the A2780 cells, which was consistent to cisplatin resistant IC50 data. Among differentially methylated loci selected from each empirical model, three time dependent methylation patterns were defined: stochastic hypomethylation, stochastic hypermethylation, and random methylation. If the empirical Bayes model of the DNA methylation assumed log-normal distribution, both stochastically hypomethylated loci and stochastically hypermethylated loci were enriched with a number of transcription factor binding sites. Almost no TFBS enrichment was observed if the gamma distribution was assumed in the empirical Bayes model. In summary, by comparing the performances of the differential methylation analysis and the TFBS enrichment analysis, log-normal distribution is a better statistical assumption than the gamma distribution in the empirical Bayes model.
Keywords :
Bayes methods; DNA; biology computing; cellular biophysics; gamma distribution; genomics; log normal distribution; molecular biophysics; stochastic processes; cisplatin treatment; empirical Bayes model; gamma distribution; genomics; log-normal distribution; methylation analysis; random methylation; stochastic hypermethylation; stochastic hypomethylation; Analytical models; Bioinformatics; DNA; Data models; Genomics; Log-normal distribution; Stochastic processes; differential methylation analysis; empirical Bayes model; transcription factor binding site enrichment;
Conference_Titel :
Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on
Conference_Location :
San Antonio, TX
Print_ISBN :
978-1-4673-0491-7
Electronic_ISBN :
2150-3001
DOI :
10.1109/GENSiPS.2011.6169428