DocumentCode :
932362
Title :
TreeDT: tree pattern mining for gene mapping
Author :
Sevon, P. ; Toivonen, H. ; Ollikainen, V.
Author_Institution :
Dept. of Comput. Sci., Helsinki Univ.
Volume :
3
Issue :
2
fYear :
2006
Firstpage :
174
Lastpage :
185
Abstract :
We describe TreeDT, a novel association-based gene mapping method. Given a set of disease-associated haplotypes and a set of control haplotypes, TreeDT predicts likely locations of a disease susceptibility gene. TreeDT extracts, essentially in the form of haplotype trees, information about historical recombinations in the population: A haplotype tree constructed at a given chromosomal location is an estimate of the genealogy of the haplotypes. TreeDT constructs these trees for all locations on the given haplotypes and performs a novel disequilibrium test on each tree: Is there a small set of subtrees with relatively high proportions of disease-associated chromosomes, suggesting shared genetic history for those and a likely disease gene location? We give a detailed description of TreeDT and the tree disequilibrium tests, we analyze the algorithm formally, and we evaluate its performance experimentally on both simulated and real data sets. Experimental results demonstrate that TreeDT has high accuracy on difficult mapping tasks and comparisons to other methods (EATDT, HPM, TDT) show that TreeDT is very competitive
Keywords :
cellular biophysics; data mining; diseases; genetics; medical diagnostic computing; molecular biophysics; trees (mathematics); TreeDT; association-based gene mapping; disease gene location; disease susceptibility gene; disease-associated chromosomes; disease-associated haplotypes; genealogy; haplotype tree; historical recombinations; shared genetic history; tree disequilibrium tests; tree pattern mining; Algorithm design and analysis; Biological cells; Chromosome mapping; Data mining; Diseases; Genetics; History; Performance analysis; Performance evaluation; Testing; Biology and genetics; nonnumerical algorithms and problems.; nonparametric statistics; Algorithms; Chromosome Mapping; Computational Biology; Computer Simulation; Diabetes Mellitus, Type 1; Genetic Predisposition to Disease; Haplotypes; Humans; Linkage Disequilibrium; Pedigree; Recombination, Genetic; Statistics, Nonparametric;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2006.28
Filename :
1631998
Link To Document :
بازگشت