DocumentCode :
120735
Title :
A compression algorithm for DNA that uses ASCII values
Author :
Priyanka ; Goel, Shivani
Author_Institution :
MSIT, IP Univ., New Delhi, India
fYear :
2014
fDate :
21-22 Feb. 2014
Firstpage :
739
Lastpage :
743
Abstract :
The properties of DNA sequences offer an opportunity to develop DNA specific compression algorithm. A lossless two phase compression algorithm is presented for DNA sequences. In the first phase a modified version of Run Length Encoding (RLE) is applied and in the second phase the resultant genetic sequences is compressed using ASCII values. Using ASCII codes for eight bits ensures one-fourth compression irrespective of repeated or non-repeated behavior of the sequence and modified RLE technique enhances the compression further more. Not only the compression ratio of the algorithm is quite encouraging but the simple technique of compression makes it more interesting.
Keywords :
DNA; biology computing; data compression; genetics; runlength codes; ASCII codes; ASCII values; CUDA; DNA sequences; DNA specific compression algorithm; genetic sequences; modified RLE technique; nonrepeated sequence behavior; phase compression algorithm; repeated sequence behavior; run length encoding; Bioinformatics; Compression algorithms; DNA; Encoding; Genomics; Image coding; Indexes; ASCII; Big data; CUDA; DNA; Run length Encoding (RLE); decode; encode;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location :
Gurgaon
Print_ISBN :
978-1-4799-2571-1
Type :
conf
DOI :
10.1109/IAdCC.2014.6779416
Filename :
6779416
Link To Document :
بازگشت