DocumentCode
3714533
Title
HAsh-MaP-ERadicator: Filtering non-target sequences from next generation sequencing reads
Author
Jonathon Brenner;Catherine Putonti
Author_Institution
Department of Computer Science, Bioinformatics Program, Loyola University Chicago, IL 60660, USA
fYear
2015
Firstpage
1100
Lastpage
1101
Abstract
Contemporary DNA sequencing technologies are continuously increasing throughput at ever decreasing costs. Moreover, due to recent advances in sequencing technology new platforms are emerging. As such computational challenges persist. The average read length possible has taken a giant leap forward with the PacBio and Nanopore solutions. Regardless of the platform used, impurities within the DNA preparation of the sample - be it from unintentional contaminants or pervasive symbiots - remains an issue. We have developed a new tool, HAsh-MaP-ERadicator (HAMPER), for the detection and removal of non-target, contaminating DNA sequences. Integrating hash-based and mapping-based strategies, HAMPER is both memory and time efficient while maintaining a high level of sensitivity. Moreover, HAMPER was designed for flexibility: reads of any size can be efficiently examined and the user can set parameters specific for the analysis of reads produced by a particular sequencer. To evaluate our method, mock sequencing runs were generated including various contaminating species and with variable rates of mutation revealing a high level of sensitivity and specificity. Reads that are not of interest can quickly be removed using HAMPER thus improving downstream analyses.
Keywords
"Genomics","Bioinformatics","Sensitivity","Sequential analysis"
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BIBM.2015.7359835
Filename
7359835
Link To Document