• DocumentCode
    698574
  • Title

    Impact of sample sizes on information theoretic measures for audio-visual signal processing

  • Author

    Arsic, Ivana ; Marina, Ninoslav ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper we aim to explore what is the most appropriate number of data samples needed when measuring the temporal correspondence between a chosen set of video and audio cues in a given audio-visual sequence. Presently the optimal model that connects statistics of audio and video signals does not exist since one does not know the most appropriate features to be extracted in order to analyze their correlation. Previous approaches assumed simple parametric and non-parametric models for the joint distribution for capturing the complex signal relationships. The main problem in using standard information theoretic quantities, such as entropy and mutual information, is to accurately estimate the probability density function from a limited number of data samples. The main idea is to project the data into a statistically sufficient low-dimensional subspace, suitable for density estimation. Then using a simple parametric model based on assumption of Gaussianity, mutual information is estimated and applied as a measure of correspondence. We exploit how the choice of the sample size affects the reliability of the correspondence measure (mutual information) between selected features of the two modalities, audio and video.
  • Keywords
    audio signal processing; feature extraction; information theory; probability; video signal processing; Gaussianity; audio signals; audio-visual sequence; audio-visual signal processing; complex signal relationships; density estimation; feature extraction; information theoretic measures; joint distribution; probability density function; video signals; Correlation; Feature extraction; Joints; Mutual information; Size measurement; Speech; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078162