• DocumentCode
    2328318
  • Title

    Accuracy of joint entropy and mutual information estimates

  • Author

    Bazsó, F. ; Zalányi, L. ; Petróczi, A.

  • Author_Institution
    KFKI Res. Inst. for Particle & Nucl. Phys., Hungarian Acad. of Sci., Budapest, Hungary
  • Volume
    4
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    2843
  • Abstract
    In practice, researchers often face the problem of being able to collect only one, possibly large, dataset, and they are forced to make inferences from a single sample. Based on the results of the polarisation operator technique of Bowman et al (1969), we computed the dependence of joint entropy and mutual information estimates on the sample size in terms of asymptotic series. These expressions enabled us to control the bias of the estimates caused by finite sample sizes and obtain an expression for the accuracies. The result is important in data mining when joint entropy and mutual information are used to find interdependences within large data sets with unknown underlying structures.
  • Keywords
    covariance analysis; data mining; data structures; entropy; asymptotic series; data mining; finite sample sizes; joint entropy; mutual information estimates; Data mining; Entropy; Frequency estimation; Image coding; Mutual information; Nuclear physics; Polarization; Probability; Psychology; Size control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1381108
  • Filename
    1381108