مرکز منطقه ای اطلاع رساني علوم و فناوري - Grouping algorithm for lossless data compression

DocumentCode :

3057879

Title :

Grouping algorithm for lossless data compression

Author :

Tadayon, Nasser ; Feng, Gui-Liang ; Rao, T.R.N. ; Hinds, E.

Author_Institution :

Center for Adv. Comput. Studies, Southwestern Louisiana Univ., Lafayette, LA, USA

fYear :

1998

fDate :

30 Mar-1 Apr 1998

Firstpage :

574

Abstract :

Summary form only given. There are in fact two main parts in this paper. One is a modification to context-tree weighting algorithm known as CTW, and the other is a new algorithm called grouping. In the CTW method, we consider a binary tree as a context-tree T_D, where each node s in this tree has length l(s) with 0⩾l(s)⩾D for a source generating a sequence of binary digits. There are counts a_s, and b_s, for each node s of T_D denoting the number of zeros and ones respectively. Each internal node s of the tree has two children 0s and 1s. The root of the tree corresponds to memoryless model λ and each node corresponds to a prefix sequence. The second part of the paper, introduces a new algorithm that considers all different binary trees of length ⩾D as complete sets of alphabets for a binary source. By using the KT estimator for each of these alphabet models, we find the probability distribution gained by all different complete extended alphabets. By grouping these models, we define the coding distribution as the average of the probabilities for all the models. We have demonstrated a quick algorithm for this idea and call this approach a grouping algorithm. This approach also breaks the extended alphabet model probability distribution into the non-extended one. Note that the result of this algorithm will produce at most log M(D) more code words than the optimal selection of strings of length at most D, as letters of the alphabet

Keywords :

data compression; encoding; probability; trees (mathematics); CTW; KT estimator; binary digits; binary source; binary trees; children; coding distribution; context-tree weighting algorithm; extended alphabet model; grouping algorithm; internal node; lossless data compression; memoryless model; nonextended model; ones; prefix sequence; probability distribution; source; string length; zeros; Algorithm design and analysis; Application software; Arithmetic; Binary sequences; Context modeling; Data compression;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Compression Conference, 1998. DCC '98. Proceedings

Conference_Location :

Snowbird, UT

ISSN :

1068-0314

Print_ISBN :

0-8186-8406-2

Type :

conf

DOI :

10.1109/DCC.1998.672316

Filename :

672316

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3057879