• DocumentCode
    3127351
  • Title

    On lossless universal compression of distributed identical sources

  • Author

    Beirami, Ahmad ; Fekri, Faramarz

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2012
  • fDate
    1-6 July 2012
  • Firstpage
    561
  • Lastpage
    565
  • Abstract
    Slepian-Wolf theorem is a well-known framework that targets almost lossless compression of (two) data streams with symbol-by-symbol correlation between the outputs of (two) distributed sources. However, this paper considers a different scenario which does not fit in the Slepian-Wolf framework. We consider two identical but spatially separated sources. We wish to study the universal compression of a sequence of length n from one of the sources provided that the decoder has access to (i.e., memorized) a sequence of length m from the other source. Such a scenario occurs, for example, in the universal compression of data from multiple mirrors of the same server. In this setup, the correlation does not arise from symbol-by-symbol dependency of two outputs from the two sources. Instead, the sequences are correlated through the information that they contain about the unknown source parameter. We show that the finite-length nature of the compression problem at hand requires considering a notion of almost lossless source coding, where coding incurs an error probability pe(n) that vanishes with sequence length n. We obtain a lower bound on the average minimax redundancy of almost lossless codes as a function of the sequence length n and the permissible error probability pe when the decoder has a memory of length m and the encoders do not communicate. Our results demonstrate that a strict performance loss is incurred when the two encoders do not communicate even when the decoder knows the unknown parameter vector (i.e., m → ∞).
  • Keywords
    decoding; minimax techniques; mirrors; probability; sequences; source coding; vectors; Slepian-Wolf theorem; average minimax redundancy; data stream; decoder; distributed identical source; distributed source; encoder; error probability; finite-length nature; length sequence; lossless source coding; lossless universal compression; multiple mirror; symbol-by-symbol correlation; unknown source parameter; vector; Channel coding; Decoding; Error probability; Redundancy; Source coding; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory Proceedings (ISIT), 2012 IEEE International Symposium on
  • Conference_Location
    Cambridge, MA
  • ISSN
    2157-8095
  • Print_ISBN
    978-1-4673-2580-6
  • Electronic_ISBN
    2157-8095
  • Type

    conf

  • DOI
    10.1109/ISIT.2012.6284253
  • Filename
    6284253