Title :
Speech extraction based on ICA and audio-visual coherence
Author :
Sodoyer, David ; Girin, Laurent ; Jutten, Christian ; Schwartz, Jean-Luc
Author_Institution :
Speech Commun. Inst., Stendhal Univ., Grenoble, France
Abstract :
We present a new approach to the source separation problem for multiple speech signals. Using the extra visual information of the speaker´s face, the method aims to extract an acoustic speech signal from other acoustic signals by exploiting its coherence with the speaker´s lip movements. We define a statistical model of the joint probability of visual and spectral audio input for quantifying the audio-visual coherence. Then, separation can be achieved by maximising this joint probability. Experiments on additive mixtures of 2, 3 and 5 sources show that the algorithm performs well, and systematically better than the classical BSS algorithm JADE.
Keywords :
acoustic signal detection; audio signal processing; audio-visual systems; blind source separation; speech processing; acoustic speech signal; audio-visual coherence; blind source separation algorithm; lip movements; multiple speech signals; source separation problem; speakers face; visual information; Additive noise; Coherence; Data mining; Filter bank; Finite impulse response filter; Independent component analysis; Laboratories; Probability; Source separation; Speech enhancement;
Conference_Titel :
Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
Print_ISBN :
0-7803-7946-2
DOI :
10.1109/ISSPA.2003.1224816