• DocumentCode
    3714533
  • Title

    HAsh-MaP-ERadicator: Filtering non-target sequences from next generation sequencing reads

  • Author

    Jonathon Brenner;Catherine Putonti

  • Author_Institution
    Department of Computer Science, Bioinformatics Program, Loyola University Chicago, IL 60660, USA
  • fYear
    2015
  • Firstpage
    1100
  • Lastpage
    1101
  • Abstract
    Contemporary DNA sequencing technologies are continuously increasing throughput at ever decreasing costs. Moreover, due to recent advances in sequencing technology new platforms are emerging. As such computational challenges persist. The average read length possible has taken a giant leap forward with the PacBio and Nanopore solutions. Regardless of the platform used, impurities within the DNA preparation of the sample - be it from unintentional contaminants or pervasive symbiots - remains an issue. We have developed a new tool, HAsh-MaP-ERadicator (HAMPER), for the detection and removal of non-target, contaminating DNA sequences. Integrating hash-based and mapping-based strategies, HAMPER is both memory and time efficient while maintaining a high level of sensitivity. Moreover, HAMPER was designed for flexibility: reads of any size can be efficiently examined and the user can set parameters specific for the analysis of reads produced by a particular sequencer. To evaluate our method, mock sequencing runs were generated including various contaminating species and with variable rates of mutation revealing a high level of sensitivity and specificity. Reads that are not of interest can quickly be removed using HAMPER thus improving downstream analyses.
  • Keywords
    "Genomics","Bioinformatics","Sensitivity","Sequential analysis"
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/BIBM.2015.7359835
  • Filename
    7359835