DocumentCode :
673195
Title :
Genalign — A high performance implementation for aligning the compressed DNA sequences
Author :
Satyanvesh, D. ; Balleda, Kaliuday ; Baruah, P.K.
Author_Institution :
Sri Sathya Sai Inst. of Higher Learning, Prasanthi Nilayam, India
fYear :
2013
fDate :
21-22 Sept. 2013
Firstpage :
1
Lastpage :
6
Abstract :
In molecular biology, sequence alignment is a way of arranging DNA, RNA or protein sequences to identify regions of similarity between the sequences. However, this is a challenging problem since the DNA sequences are huge in size and the databases are growing at an exponential rate. It requires tremendous amount of memory and large computational power. For example, the human genome in raw format ranges from 2 to 30 Tera-bytes. The inherent property of DNA is that it contains many repeats which makes it highly compressible. This paper presents a new approach of aligning the sequences after compressing them. The alignment consists of both ungapped and gapped alignment. Multi-cores and GPUs can be used to align these huge sequences quickly on the compressed sequences. The focus mainly is on aligning the huge sequences accurately. The ungapped alignment achieves a speedup of upto 56 on K20 Kepler GPUs and the gapped alignment achieves a speedup of upto 15 on multi-cores.
Keywords :
DNA; RNA; bioinformatics; graphics processing units; molecular biophysics; molecular configurations; multiprocessing systems; parallel processing; proteins; GenAlign; K20 Kepler GPU; RNA sequences; compressed DNA sequences; gapped alignment; high performance implementation; molecular biology; multicore system; protein sequences; sequence alignment; ungapped alignment; Bioinformatics; DNA; Dynamic programming; Graphics processing units; Heuristic algorithms; Instruction sets; Random access memory; Bandwidth; Code byte; Compression Ratio; DNA Sequence; Gapped align-ment; Speedup; Throughput; Ungapped alignment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing Technologies (ICACT), 2013 15th International Conference on
Conference_Location :
Rajampet
Print_ISBN :
978-1-4673-2816-6
Type :
conf
DOI :
10.1109/ICACT.2013.6710490
Filename :
6710490
Link To Document :
بازگشت