DocumentCode
698574
Title
Impact of sample sizes on information theoretic measures for audio-visual signal processing
Author
Arsic, Ivana ; Marina, Ninoslav ; Thiran, Jean-Philippe
Author_Institution
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
In this paper we aim to explore what is the most appropriate number of data samples needed when measuring the temporal correspondence between a chosen set of video and audio cues in a given audio-visual sequence. Presently the optimal model that connects statistics of audio and video signals does not exist since one does not know the most appropriate features to be extracted in order to analyze their correlation. Previous approaches assumed simple parametric and non-parametric models for the joint distribution for capturing the complex signal relationships. The main problem in using standard information theoretic quantities, such as entropy and mutual information, is to accurately estimate the probability density function from a limited number of data samples. The main idea is to project the data into a statistically sufficient low-dimensional subspace, suitable for density estimation. Then using a simple parametric model based on assumption of Gaussianity, mutual information is estimated and applied as a measure of correspondence. We exploit how the choice of the sample size affects the reliability of the correspondence measure (mutual information) between selected features of the two modalities, audio and video.
Keywords
audio signal processing; feature extraction; information theory; probability; video signal processing; Gaussianity; audio signals; audio-visual sequence; audio-visual signal processing; complex signal relationships; density estimation; feature extraction; information theoretic measures; joint distribution; probability density function; video signals; Correlation; Feature extraction; Joints; Mutual information; Size measurement; Speech; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078162
Link To Document