• DocumentCode
    35122
  • Title

    On Match Lengths, Zero Entropy, and Large Deviations—With Application to Sliding Window Lempel–Ziv Algorithm

  • Author

    Jain, Sonal ; Bansal, Rakesh K.

  • Author_Institution
    Dept. of Electr. Eng., IIT Kanpur, Kanpur, India
  • Volume
    61
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 2015
  • Firstpage
    120
  • Lastpage
    132
  • Abstract
    The sliding window Lempel-Ziv (SWLZ) algorithm that makes use of recurrence times and match lengths has been studied from various perspectives in information theory literature. In this paper, we undertake a finer study of these quantities under two different scenarios: 1) zero entropy sources that are characterized by strong long-term memory and 2) the processes with weak memory as described through various mixing conditions. For zero entropy sources, a general statement on match length is obtained. It is used in the proof of almost sure optimality of fixed shift variant of Lempel-Ziv (FSLZ) and SWLZ algorithms given in literature. Through an example of stationary and ergodic processes generated by an irrational rotation, we establish that for a window of size nw, a compression ratio given by O(log nw/nwa), where a depends on nw and approaches 1 as nw → ∞, is obtained under the application of FSLZ and SWLZ algorithms. In addition, we give a general expression for the compression ratio for a class of stationary and ergodic processes with zero entropy. Next, we extend the study of Ornstein and Weiss on the asymptotic behavior of the normalized version of recurrence times and establish the large deviation property for a class of mixing processes. In addition, an estimator of entropy based on recurrence times is proposed for which large deviation principle is proved for sources satisfying similar mixing conditions.
  • Keywords
    computational complexity; convergence; entropy; compression ratio; ergodic process; exponential convergence; fixed shift variant-of-Lempel-Ziv algorithm; information theory; large deviation property; match lengths; mixing conditions; recurrence times; sliding window Lempel-Ziv algorithm; stationary process; zero entropy sources; Algorithm design and analysis; Convergence; Databases; Entropy; Jacobian matrices; Redundancy; Upper bound; Irrational rotation; entropy dimension; entropy estimator; exponential convergence; large deviation property; match lengths; mixing conditions; recurrence times; sliding window Lempel-Ziv; zero entropy;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2014.2368986
  • Filename
    6951479