DocumentCode :
1987180
Title :
Haplotype motifs: an algorithmic approach to locating evolutionarily conserved patterns in haploid sequences
Author :
Schwartz, Russell
Author_Institution :
Dept. of Biol. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2003
fDate :
11-14 Aug. 2003
Firstpage :
306
Lastpage :
314
Abstract :
The promise of plentiful data on common human genetic variations has given hope that we will be able to uncover genetic factors behind common diseases that have proven difficult to locate by prior methods. Much recent interest in this problem has focused on using haplotypes (contiguous regions of correlated genetic variations), instead of the isolated variations, in order to reduce the size of the statistical analysis problem. In order to most effectively use such variation data, we will need a better understanding of haplotype structure, including both the general principles underlying haplotype structure in the human population and the specific structures found in particular genetic regions or sub-populations. This paper presents a probabilistic model for analyzing haplotype structure in a population using conserved motifs found in statistically significant sub-populations. It describes the model and computational methods for deriving the predicted motif set and haplotype structure for a population. It further presents results on simulated data, in order to validate the method, and on two real datasets from the literature, in order to illustrate its practical application.
Keywords :
biology computing; cellular biophysics; diseases; dynamic programming; genetic algorithms; genetics; physiological models; probability; statistical analysis; diseases; dynamic programming; expectation maximization; haploid sequences; haplotype motifs; human genetic variations; probabilistic model; real datasets; significance testing; simulated data; single nucleotide polymorphism; statistical analysis; Bioinformatics; Biology; Computational modeling; Diseases; Genetics; Genomics; Humans; Predictive models; Statistical analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
Type :
conf
DOI :
10.1109/CSB.2003.1227331
Filename :
1227331
Link To Document :
بازگشت