• DocumentCode
    3607409
  • Title

    Compression in the Space of Permutations

  • Author

    Da Wang ; Mazumdar, Arya ; Wornell, Gregory W.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Massachusetts Inst. of Technol., Cambridge, MA, USA
  • Volume
    61
  • Issue
    12
  • fYear
    2015
  • Firstpage
    6417
  • Lastpage
    6431
  • Abstract
    We investigate lossy compression (source coding) of data in the form of permutations. This problem has direct applications in the storage of ordinal data or rankings, and in the analysis of sorting algorithms. We analyze the rate-distortion characteristic for the permutation space under the uniform distribution, and the minimum achievable rate of compression that allows a bounded distortion after recovery. Our analysis is with respect to different practical and useful distortion measures, including Kendall tau distance, Spearman´s footrule, Chebyshev distance, and inversion-ℓ1 distance. We establish equivalence of source code designs under certain distortions and show simple explicit code designs that incur low encoding/decoding complexities and are asymptotically optimal. Finally, we show that for the Mallows model, a popular nonuniform ranking model on the permutation space, both the entropy and the maximum distortion at zero rate are much lower than the uniform counterparts, which motivates the future design of efficient compression schemes for this model.
  • Keywords
    encoding; sorting; source coding; Chebyshev distance; Kendall tau distance; Spearman footrule; bounded distortion; encoding-decoding complexities; lossy compression; nonuniform ranking model; ordinal data; permutation space; sorting algorithms; source code designs; source coding; space of permutations; uniform distribution; Chebyshev approximation; Distortion; Distortion measurement; Loss measurement; Rate-distortion; Sorting; Tin; Lossy compressions; lossy compressions; mallows model; partial sorting; permutation space;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2015.2485270
  • Filename
    7286821