• DocumentCode
    673195
  • Title

    Genalign — A high performance implementation for aligning the compressed DNA sequences

  • Author

    Satyanvesh, D. ; Balleda, Kaliuday ; Baruah, P.K.

  • Author_Institution
    Sri Sathya Sai Inst. of Higher Learning, Prasanthi Nilayam, India
  • fYear
    2013
  • fDate
    21-22 Sept. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In molecular biology, sequence alignment is a way of arranging DNA, RNA or protein sequences to identify regions of similarity between the sequences. However, this is a challenging problem since the DNA sequences are huge in size and the databases are growing at an exponential rate. It requires tremendous amount of memory and large computational power. For example, the human genome in raw format ranges from 2 to 30 Tera-bytes. The inherent property of DNA is that it contains many repeats which makes it highly compressible. This paper presents a new approach of aligning the sequences after compressing them. The alignment consists of both ungapped and gapped alignment. Multi-cores and GPUs can be used to align these huge sequences quickly on the compressed sequences. The focus mainly is on aligning the huge sequences accurately. The ungapped alignment achieves a speedup of upto 56 on K20 Kepler GPUs and the gapped alignment achieves a speedup of upto 15 on multi-cores.
  • Keywords
    DNA; RNA; bioinformatics; graphics processing units; molecular biophysics; molecular configurations; multiprocessing systems; parallel processing; proteins; GenAlign; K20 Kepler GPU; RNA sequences; compressed DNA sequences; gapped alignment; high performance implementation; molecular biology; multicore system; protein sequences; sequence alignment; ungapped alignment; Bioinformatics; DNA; Dynamic programming; Graphics processing units; Heuristic algorithms; Instruction sets; Random access memory; Bandwidth; Code byte; Compression Ratio; DNA Sequence; Gapped align-ment; Speedup; Throughput; Ungapped alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computing Technologies (ICACT), 2013 15th International Conference on
  • Conference_Location
    Rajampet
  • Print_ISBN
    978-1-4673-2816-6
  • Type

    conf

  • DOI
    10.1109/ICACT.2013.6710490
  • Filename
    6710490