DocumentCode :
3057879
Title :
Grouping algorithm for lossless data compression
Author :
Tadayon, Nasser ; Feng, Gui-Liang ; Rao, T.R.N. ; Hinds, E.
Author_Institution :
Center for Adv. Comput. Studies, Southwestern Louisiana Univ., Lafayette, LA, USA
fYear :
1998
fDate :
30 Mar-1 Apr 1998
Firstpage :
574
Abstract :
Summary form only given. There are in fact two main parts in this paper. One is a modification to context-tree weighting algorithm known as CTW, and the other is a new algorithm called grouping. In the CTW method, we consider a binary tree as a context-tree TD, where each node s in this tree has length l(s) with 0⩾l(s)⩾D for a source generating a sequence of binary digits. There are counts as , and bs, for each node s of TD denoting the number of zeros and ones respectively. Each internal node s of the tree has two children 0s and 1s. The root of the tree corresponds to memoryless model λ and each node corresponds to a prefix sequence. The second part of the paper, introduces a new algorithm that considers all different binary trees of length ⩾D as complete sets of alphabets for a binary source. By using the KT estimator for each of these alphabet models, we find the probability distribution gained by all different complete extended alphabets. By grouping these models, we define the coding distribution as the average of the probabilities for all the models. We have demonstrated a quick algorithm for this idea and call this approach a grouping algorithm. This approach also breaks the extended alphabet model probability distribution into the non-extended one. Note that the result of this algorithm will produce at most log M(D) more code words than the optimal selection of strings of length at most D, as letters of the alphabet
Keywords :
data compression; encoding; probability; trees (mathematics); CTW; KT estimator; binary digits; binary source; binary trees; children; coding distribution; context-tree weighting algorithm; extended alphabet model; grouping algorithm; internal node; lossless data compression; memoryless model; nonextended model; ones; prefix sequence; probability distribution; source; string length; zeros; Algorithm design and analysis; Application software; Arithmetic; Binary sequences; Context modeling; Data compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1998. DCC '98. Proceedings
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
0-8186-8406-2
Type :
conf
DOI :
10.1109/DCC.1998.672316
Filename :
672316
Link To Document :
بازگشت