DocumentCode :
1147400
Title :
The number of recombination events in a sample history: conflict graph and lower bounds
Author :
Bafna, Vineet ; Bansal, Vikas
Volume :
1
Issue :
2
fYear :
2004
Firstpage :
78
Lastpage :
90
Abstract :
We consider the following problem: Given a set of binary sequences, determine lower bounds on the minimum number of recombinations required to explain the history of the sample, under the infinite-sites model of mutation. The problem has implications for finding recombination hotspots and for the Ancestral Recombination Graph reconstruction problem. Hudson and Kaplan gave a lower bound based on the four-gamete test. In practice, their bound Rm often greatly underestimates the minimum number of recombinations. The problem was recently revisited by Myers and Griffiths, who introduced two new lower bounds Rh and Rs which are provably better, and also yield good bounds in practice. However, the worst-case complexities of their procedures for computing Rh and Rs are exponential and super-exponential, respectively. In this paper, we show that the number of nontrivial connected components, Rc, in the conflict graph for a given set of sequences, computable in time 0(nm2), is also a lower bound on the minimum number of recombination events. We show that in many cases, Rc is a better bound than Rh. The conflict graph was used by Gusfield et al. to obtain a polynomial time algorithm for the galled tree problem, which is a special case of the Ancestral Recombination Graph (ARG) reconstruction problem. Our results also offer some insight into the structural properties of this graph and are of interest for the general Ancestral Recombination Graph reconstruction problem.
Keywords :
biology computing; evolution (biological); genetics; physiological models; trees (mathematics); ancestral recombination graph reconstruction problem; binary sequences; conflict graph; four-gamete test; galled tree problem; infinite-sites mutation model; lower bounds; polynomial time algorithm; recombination events; recombination hotspots; sample history; Binary sequences; Computational biology; Genetic mutations; History; Phylogeny; Polynomials; Shape; Testing; Tree graphs; Index Terms- Recombination; NP-completeness.; ancestral recombination graph; conflict graph; haplotypes; lower bounds; phylogenetic networks; Alcohol Dehydrogenase; Algorithms; Animals; Computational Biology; Drosophila melanogaster; Evolution, Molecular; Haplotypes; Models, Genetic; Mutation; Phylogeny; Recombination, Genetic;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2004.23
Filename :
1350750
Link To Document :
بازگشت