• DocumentCode
    181648
  • Title

    An extended similarity distance for use with computable information estimators

  • Author

    Speidel, U.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Auckland, Auckland, New Zealand
  • fYear
    2014
  • fDate
    26-29 Oct. 2014
  • Firstpage
    304
  • Lastpage
    308
  • Abstract
    Computable complexity and information estimators such as those from the Lempel-Ziv family or Titchener´s T-complexity and T-information may be used in similarity comparison in conjunction with the Normalized Compression Distance (NCD). The NCD is (almost) a metric and computes a similarity distance between two digitally encoded objects x and y based exclusively on their estimated individual and joint information content. In some similarity comparison applications, however, objects may also be distinguished by entropy rate rather than information content only. However, the NCD is not sensitive to entropy rate. This paper proposes an entropy rate sensitive extended version of the NCD, called ENCD, for use in such applications. It also shows that the T-information performs well in the context of both NCD and ENCD. Finally, the paper discusses the problem of added noise and scaling in input data to the NCD and ENCD, and demonstrates how appropriate encoding of the input data may mitigate the impact of these effects.
  • Keywords
    computational complexity; data compression; entropy; ENCD; Lempel-Ziv family; NCD metric; T-complexity; T-information; added noise problem; computable complexity; computable information estimators; digitally encoded objects; entropy rate sensitive extended NCD; extended similarity distance; individual information content; input data encoding; joint information content; normalized compression distance; scaling problem; similarity comparison applications; Complexity theory; Compressors; Entropy; Noise measurement; Signal to noise ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory and its Applications (ISITA), 2014 International Symposium on
  • Conference_Location
    Melbourne, VIC
  • Type

    conf

  • Filename
    6979853