• DocumentCode
    498423
  • Title

    Sorting Genomes by Translocations and Deletions, Dealing with Duplicated Genes

  • Author

    Qi, Xingqin ; Li, Guojun

  • Author_Institution
    Sch. of Appl. Math. & Stat., Shandong Univ. at Weihai, Weihai, China
  • Volume
    2
  • fYear
    2009
  • fDate
    19-21 May 2009
  • Firstpage
    179
  • Lastpage
    183
  • Abstract
    The problem of sorting by reciprocal translocations (abbreviated as SBT) arises from the field of comparative genomics, which is to find a shortest sequence of reciprocal translocations that transforms one genome Pi into another genome Gamma, with the restriction that Pi and Gamma contain the same genes. SBT has been proved to be polynomial-time solvable, and several polynomial algorithms have been developed, including Bergeron´s SBT algorithm. Recently, an algorithm allowing to compare genomes with different genes but without duplications is given by extending Bergeron´s SBT algorithm. Here, we will improve the approach to handle genomes with duplicated genes. Obviously, the existence of duplications will make the problem more difficult. We should decide which one of the duplications in the subject genome Pi should be assigned to a particular homolog in the target genome Gamma; resolving such a problem appears to require exponential time. In this paper, by a similar method developed by Mark Marron et. al., which was used to sort two chromosomes containing different genes, we present an approximation algorithm for computing near minimal sorting sequences involving translocations and deletions between two (multi-chromosomes) genomes containing different genes allowing non-tails duplications. We derive the error bound for the polynomial-time distance computation. This is the first constant approximation algorithm for SBT with duplications.
  • Keywords
    approximation theory; biology computing; cellular biophysics; computational complexity; genetics; sorting; Bergeron SBT algorithm; approximation algorithm; chromosome sorting; comparative genomics; duplicated gene; exponential time; genome sorting; homolog; polynomial algorithm; polynomial-time distance computation; reciprocal translocation; shortest genome sequence; sorting-by-reversal; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Biological cells; Genomics; Intelligent systems; Mathematics; Polynomials; Sorting; Statistics; deletions; duplications; translocations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-0-7695-3571-5
  • Type

    conf

  • DOI
    10.1109/GCIS.2009.51
  • Filename
    5209447