• DocumentCode
    3640042
  • Title

    PSI-RA: A parallel sparse index for read alignment on genomes

  • Author

    M. Oguzhan Külekci;Wing-Kai Hon;Rahul Shah;Jeffrey Scott Vitter;Bojian Xu

  • Author_Institution
    National Research Institute of Electronics &
  • fYear
    2010
  • Firstpage
    663
  • Lastpage
    668
  • Abstract
    We concentrate on indexing DNA sequences via sparse suffix arrays (SSAs) and propose a new short read aligner named PSI-RA (parallel sparse index read aligner). The motivation in using SSAs is the ability to trade memory against time. It is possible to tune the space consumption of the index based on the available memory of the machine and the minimum length of the arriving pattern queries. Although SSAs have been studied before on exact matching of short reads, an elegant way of approximate matching capability was missing. We provide this by defining the right-most mismatch criteria that prioritizes the errors towards the end of the reads since it is known that the errors are more probable at that area. PSI-RA supports any number of mismatches in aligning reads. We give comparisons with some of the well known short read aligners, and show that indexing genome with SSA is a good alternative to Burrows-Wheeler transform or seed based solutions.
  • Keywords
    "Genomics","Arrays","Indexing","DNA","Complexity theory","Humans"
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Print_ISBN
    978-1-4244-8306-8
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706648
  • Filename
    5706648