• DocumentCode
    1180219
  • Title

    New Methods for Inference of Local Tree Topologies with Recombinant SNP Sequences in Populations

  • Author

    Wu, Yufeng

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
  • Volume
    8
  • Issue
    1
  • fYear
    2011
  • Firstpage
    182
  • Lastpage
    193
  • Abstract
    Large amount of population-scale genetic variation data are being collected in populations. One potentially important biological problem is to infer the population genealogical history from these genetic variation data. Partly due to recombination, genealogical history of a set of DNA sequences in a population usually cannot be represented by a single tree. Instead, genealogy is better represented by a genealogical network, which is a compact representation of a set of correlated local genealogical trees, each for a short region of genome and possibly with different topology. Inference of genealogical history for a set of DNA sequences under recombination has many potential applications, including association mapping of complex diseases. In this paper, we present two new methods for reconstructing local tree topologies with the presence of recombination, which extend and improve the previous work in. We first show that the "tree scan” method can be converted to a probabilistic inference method based on a hidden Markov model. We then focus on developing a novel local tree inference method called RENT that is both accurate and scalable to larger data. Through simulation, we demonstrate the usefulness of our methods by showing that the hidden-Markov-model-based method is comparable with the original method in terms of accuracy. We also show that RENT is competitive with other methods in terms of inference accuracy, and its inference error rate is often lower and can handle large data.
  • Keywords
    DNA; bioinformatics; genomics; hidden Markov models; DNA sequence; RENT method; genome; hidden Markov model; local tree topology; population genealogical history; population scale genetic variation; recombinant SNP sequence; tree scan method; Bioinformatics; DNA; Diseases; Genetic mutations; Genomics; Hidden Markov models; History; Network topology; Sequences; Tree graphs; Population genetics; algorithm; ancestral recombination graph; hidden Markov model.; recombination; Algorithms; Computer Simulation; Genetics, Population; Genomics; Humans; Markov Chains; Pedigree; Phylogeny; Polymorphism, Single Nucleotide; Recombination, Genetic; Sequence Analysis, DNA;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2009.27
  • Filename
    4796189