Title :
A differential spectral voice activity detector
Author :
Garner, Philip N. ; Fukada, Toshiaki ; Komori, Yasuhiro
Author_Institution :
Canon Inc, Tokyo, Japan
Abstract :
The voice activity detection (VAD) problem is placed into a decision theoretic framework, and the Gaussian VAD model of Sohn et al. (1998, 1999) is then shown to fit well with the framework. It is argued that the Gaussian model can be made more robust to correlation and expected spectral shapes of speech and noise by using a differential spectral representation. Such a model is formulated theoretically. The differential spectral VAD is then shown by experiment to compare favourably with the basic Gaussian VAD in a speech recognition setting, especially for noisy environments.
Keywords :
Gaussian distribution; decision theory; signal representation; spectral analysis; speech recognition; Gaussian VAD model; correlation robustness; differential spectral representation; spectral shapes; speech recognition; voice activity detector; Automatic speech recognition; Cost function; Detectors; Gaussian noise; Noise robustness; Noise shaping; Spectral shape; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326056