DocumentCode :
2159500
Title :
An efficient compressor for biological sequences
Author :
Gupta, Arpan ; Dubey, K.K.
Author_Institution :
Dept. of CS & IT, MJP Rohilkhand Univ., Bareilly, India
fYear :
2013
fDate :
22-23 Feb. 2013
Firstpage :
690
Lastpage :
695
Abstract :
This paper introduces a state of art compressor for DNA sequences that makes use of a replacement method. The replacement method introduces words and a word based compression scheme is used for encoding. The encoder uses frequency distribution for assigning the code of words. The designed statistical compression algorithm is efficient and effective for DNA sequence compression. Experiments show that our algorithm is shown to outperform existing compressors on typical DNA sequence datasets.
Keywords :
DNA; bioinformatics; data compression; encoding; statistical analysis; DNA sequence compression; DNA sequence datasets; biological sequence compressor; encoding; frequency distribution; replacement method; state of art compressor; statistical compression algorithm; word based compression scheme; Biological information theory; Compression algorithms; Context; DNA; Dictionaries; Encoding; Vocabulary; DNA compression; DNA sequences; Word based tagged code;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advance Computing Conference (IACC), 2013 IEEE 3rd International
Conference_Location :
Ghaziabad
Print_ISBN :
978-1-4673-4527-9
Type :
conf
DOI :
10.1109/IAdCC.2013.6514310
Filename :
6514310
Link To Document :
بازگشت