DocumentCode :
147107
Title :
Compression of Quality Factors in Next Generation Sequencing
Author :
Nalbantoglu, O.U. ; Sayood, K.
Author_Institution :
Dept. of Electr. Eng., Univ. of Nebraska, Lincoln, NE, USA
fYear :
2014
fDate :
26-28 March 2014
Firstpage :
419
Lastpage :
419
Abstract :
We propose a compression algorithm for the quality scores contained in FASTQ files which are generated in large volumes during high throughput sequencing. The proposed algorithm is a context dependent arithmetic coder which is based on observations of the structure of quality scores in FASTQ files. Simulation results indicate a significantly superior performance of the algorithm to the current state of the art.
Keywords :
Q-factor; arithmetic codes; data compression; FASTQ files; compression algorithm; context dependent arithmetic coder; high throughput sequencing; next generation sequencing; quality factors compression; quality scores; Context; Data compression; Educational institutions; Electrical engineering; Next generation networking; Q-factor; Sequential analysis; Biological sequence compression; DNA; Quality factor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference (DCC), 2014
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Type :
conf
DOI :
10.1109/DCC.2014.46
Filename :
6824471
Link To Document :
بازگشت