Title :
Random-access compression of annotated DNA sequences
Author :
Korodi, Gergely ; Tabus, Ioan
Author_Institution :
Inst. of Signal Process., Tampere Univ. of Technol., Tampere
Abstract :
This article investigates the efficiency of randomly accessible coding for annotated genome files and compares it to universal coding. The result is an encoder which has excellent compression efficiency on annotated genome sequences, provides instantaneous access to functional elements in the file, and thus it serves as a basis for further applications, such as indexing and searching for specified feature entries.
Keywords :
DNA; biology computing; data compression; encoding; file organisation; random processes; sequences; encoder; functional element; random-access DNA sequence compression; Bioinformatics; Biological information theory; Biomedical signal processing; DNA; Genomics; Indexing; Information retrieval; Probability distribution; Sequences; Training data;
Conference_Titel :
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
1-4244-0384-7
Electronic_ISBN :
1-4244-0385-5
DOI :
10.1109/GENSIPS.2006.353160