DocumentCode :
454624
Title :
An Analysis of Visual Speech Information Applied to Voice Activity Detection
Author :
Sodoyer, David ; Rivet, Bertrand ; Girin, Laurent ; Schwartz, Jean-Luc ; Jutten, Christian
Author_Institution :
Inst. of Speech Commun., CNRS, Grenoble
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
We present a new approach to the voice activity detection (VAD) problem for speech signals embedded in non-stationary noise. The method is based on automatic lipreading: the objective is to detect voice activity or non-activity by exploiting the coherence between the speech acoustic signal and the speaker´s lip movements. From a comprehensive analysis of lip shape parameters during speech and non-speech events, we show that a single appropriate visual parameter, defined to characterize the lip movements, can be used for the detection of sections of voice activity or more precisely, for the detection of silence sections. Detection scores obtained on spontaneous speech confirm the efficiency of the visual voice activity detector (VVAD)
Keywords :
face recognition; gesture recognition; speech recognition; automatic lipreading; lip movements; nonspeech events; nonstationary noise; speech acoustic signal; visual speech information; visual voice activity detector; voice activity detection; Acoustic noise; Acoustic signal detection; Background noise; Detectors; Event detection; Information analysis; Signal processing; Speech analysis; Speech enhancement; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660092
Filename :
1660092
Link To Document :
بازگشت