Title :
Probability Theory-Based SNP Association Study Method for Identifying Susceptibility Loci and Genetic Disease Models in Human Case-Control Data
Author :
Yuan, Xiguo ; Zhang, Junying ; Wang, Yue
Author_Institution :
Sch. of Comput. Sci. & Eng., Xidian Univ., Xi´´an, China
Abstract :
One of the most challenging points in studying human common complex diseases is to search for both strong and weak susceptibility single-nucleotide polymorphisms (SNPs) and identify forms of genetic disease models. Currently, a number of methods have been proposed for this purpose. Many of them have not been validated through applications into various genome datasets, so their abilities are not clear in real practice. In this paper, we present a novel SNP association study method based on probability theory, called ProbSNP. The method firstly detects SNPs by evaluating their joint probabilities in combining with disease status and selects those with the lowest joint probabilities as susceptibility ones, and then identifies some forms of genetic disease models through testing multiple-locus interactions among the selected SNPs. The joint probabilities of combined SNPs are estimated by establishing Gaussian distribution probability density functions, in which the related parameters (i.e., mean value and standard deviation) are evaluated based on allele and haplotype frequencies. Finally, we test and validate the method using various genome datasets. We find that ProbSNP has shown remarkable success in the applications to both simulated genome data and real genome-wide data.
Keywords :
DNA; Gaussian distribution; bioinformatics; diseases; genetics; molecular biophysics; molecular configurations; probability; Gaussian distribution PDF; ProbSNP; SNP association study; genetic disease models; human case control data; human common complex diseases; multiple locus interactions; probability density functions; probability theory; real genome wide data; simulated genome data; single nucleotide polymorphisms; strong susceptibility SNP; susceptibility loci identification; weak susceptibility SNP; Bioinformatics; Biomarkers; Data models; Diseases; Gaussian distribution; Genomics; Probability; Association study; Gaussian distribution; SNPs; case-control; probability theory; Case-Control Studies; Genetic Association Studies; Genetic Loci; Genetic Predisposition to Disease; Genetic Testing; Humans; Models, Genetic; Models, Statistical; Polymorphism, Single Nucleotide; Probability Theory;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2010.2070805