DocumentCode
3640042
Title
PSI-RA: A parallel sparse index for read alignment on genomes
Author
M. Oguzhan Külekci;Wing-Kai Hon;Rahul Shah;Jeffrey Scott Vitter;Bojian Xu
Author_Institution
National Research Institute of Electronics &
fYear
2010
Firstpage
663
Lastpage
668
Abstract
We concentrate on indexing DNA sequences via sparse suffix arrays (SSAs) and propose a new short read aligner named PSI-RA (parallel sparse index read aligner). The motivation in using SSAs is the ability to trade memory against time. It is possible to tune the space consumption of the index based on the available memory of the machine and the minimum length of the arriving pattern queries. Although SSAs have been studied before on exact matching of short reads, an elegant way of approximate matching capability was missing. We provide this by defining the right-most mismatch criteria that prioritizes the errors towards the end of the reads since it is known that the errors are more probable at that area. PSI-RA supports any number of mismatches in aligning reads. We give comparisons with some of the well known short read aligners, and show that indexing genome with SSA is a good alternative to Burrows-Wheeler transform or seed based solutions.
Keywords
"Genomics","Arrays","Indexing","DNA","Complexity theory","Humans"
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Print_ISBN
978-1-4244-8306-8
Type
conf
DOI
10.1109/BIBM.2010.5706648
Filename
5706648
Link To Document