• DocumentCode
    1147400
  • Title

    The number of recombination events in a sample history: conflict graph and lower bounds

  • Author

    Bafna, Vineet ; Bansal, Vikas

  • Volume
    1
  • Issue
    2
  • fYear
    2004
  • Firstpage
    78
  • Lastpage
    90
  • Abstract
    We consider the following problem: Given a set of binary sequences, determine lower bounds on the minimum number of recombinations required to explain the history of the sample, under the infinite-sites model of mutation. The problem has implications for finding recombination hotspots and for the Ancestral Recombination Graph reconstruction problem. Hudson and Kaplan gave a lower bound based on the four-gamete test. In practice, their bound Rm often greatly underestimates the minimum number of recombinations. The problem was recently revisited by Myers and Griffiths, who introduced two new lower bounds Rh and Rs which are provably better, and also yield good bounds in practice. However, the worst-case complexities of their procedures for computing Rh and Rs are exponential and super-exponential, respectively. In this paper, we show that the number of nontrivial connected components, Rc, in the conflict graph for a given set of sequences, computable in time 0(nm2), is also a lower bound on the minimum number of recombination events. We show that in many cases, Rc is a better bound than Rh. The conflict graph was used by Gusfield et al. to obtain a polynomial time algorithm for the galled tree problem, which is a special case of the Ancestral Recombination Graph (ARG) reconstruction problem. Our results also offer some insight into the structural properties of this graph and are of interest for the general Ancestral Recombination Graph reconstruction problem.
  • Keywords
    biology computing; evolution (biological); genetics; physiological models; trees (mathematics); ancestral recombination graph reconstruction problem; binary sequences; conflict graph; four-gamete test; galled tree problem; infinite-sites mutation model; lower bounds; polynomial time algorithm; recombination events; recombination hotspots; sample history; Binary sequences; Computational biology; Genetic mutations; History; Phylogeny; Polynomials; Shape; Testing; Tree graphs; Index Terms- Recombination; NP-completeness.; ancestral recombination graph; conflict graph; haplotypes; lower bounds; phylogenetic networks; Alcohol Dehydrogenase; Algorithms; Animals; Computational Biology; Drosophila melanogaster; Evolution, Molecular; Haplotypes; Models, Genetic; Mutation; Phylogeny; Recombination, Genetic;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2004.23
  • Filename
    1350750