DocumentCode
1147400
Title
The number of recombination events in a sample history: conflict graph and lower bounds
Author
Bafna, Vineet ; Bansal, Vikas
Volume
1
Issue
2
fYear
2004
Firstpage
78
Lastpage
90
Abstract
We consider the following problem: Given a set of binary sequences, determine lower bounds on the minimum number of recombinations required to explain the history of the sample, under the infinite-sites model of mutation. The problem has implications for finding recombination hotspots and for the Ancestral Recombination Graph reconstruction problem. Hudson and Kaplan gave a lower bound based on the four-gamete test. In practice, their bound Rm often greatly underestimates the minimum number of recombinations. The problem was recently revisited by Myers and Griffiths, who introduced two new lower bounds Rh and Rs which are provably better, and also yield good bounds in practice. However, the worst-case complexities of their procedures for computing Rh and Rs are exponential and super-exponential, respectively. In this paper, we show that the number of nontrivial connected components, Rc, in the conflict graph for a given set of sequences, computable in time 0(nm2), is also a lower bound on the minimum number of recombination events. We show that in many cases, Rc is a better bound than Rh. The conflict graph was used by Gusfield et al. to obtain a polynomial time algorithm for the galled tree problem, which is a special case of the Ancestral Recombination Graph (ARG) reconstruction problem. Our results also offer some insight into the structural properties of this graph and are of interest for the general Ancestral Recombination Graph reconstruction problem.
Keywords
biology computing; evolution (biological); genetics; physiological models; trees (mathematics); ancestral recombination graph reconstruction problem; binary sequences; conflict graph; four-gamete test; galled tree problem; infinite-sites mutation model; lower bounds; polynomial time algorithm; recombination events; recombination hotspots; sample history; Binary sequences; Computational biology; Genetic mutations; History; Phylogeny; Polynomials; Shape; Testing; Tree graphs; Index Terms- Recombination; NP-completeness.; ancestral recombination graph; conflict graph; haplotypes; lower bounds; phylogenetic networks; Alcohol Dehydrogenase; Algorithms; Animals; Computational Biology; Drosophila melanogaster; Evolution, Molecular; Haplotypes; Models, Genetic; Mutation; Phylogeny; Recombination, Genetic;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2004.23
Filename
1350750
Link To Document