• DocumentCode
    2737018
  • Title

    Poster: ViSpA: Viral spectrum assembling method

  • Author

    Astrovskaya, Irina ; Tork, Bassam ; Mangul, Serghei ; Westbrooks, Kelly ; Mandoiu, Ion ; Balfe, Peter ; Zelikovsky, Alex

  • Author_Institution
    Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
  • fYear
    2011
  • fDate
    3-5 Feb. 2011
  • Firstpage
    234
  • Lastpage
    234
  • Abstract
    Like many RNA viruses, Hepatitis C virus (HCV) exists as a set of closely related sequences (quasispecies). The diversity of the quasispecies sequences can explain vaccines failures and virus resistance to existing therapies. Since the original software of next-generation sequencing systems assumes a single genome, there is a need for a new assembler that infers viral population in a host. Thus, the paper focuses on Quasispecies Spectrum Reconstruction (QSR) Problem: given a collection of 454 pyrosequencing reads taken from a sample quasispecies population, reconstruct the quasispecies spectrum, i.e., the set of sequences and the relative frequency of each sequence in the sample population.This poster introduces the ViSpA method that significantly extends previous approach by handling contaminated reads and overlaps with partial agreement between reads, by assembling haplotypes from per-vertex max-bandwidth paths via mutation-based clustering, and by estimating assemblies´ frequencies via EM. A procedure to fix systematic 454 errors in homopolymers if they happen in the coding region is suggested.
  • Keywords
    diseases; macromolecules; medical computing; microorganisms; molecular biophysics; molecular configurations; polymers; RNA virus; ViSpA; closely related sequences; coding region; haplotypes; hepatitis C virus; homopolymers; mutation-based clustering; next-generation sequencing systems; per-vertex max-bandwidth paths; pyrosequencing; quasispecies; quasispecies spectrum reconstruction; vaccines failures; viral population; viral spectrum assembling method; virus resistance; Assembly; Computer science; Electronic mail; Frequency estimation; Proteins; Software; Systematics; Next-generation sequencing; expectation maximization; viral assembling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-61284-851-8
  • Type

    conf

  • DOI
    10.1109/ICCABS.2011.5729888
  • Filename
    5729888