DocumentCode :
417218
Title :
A differential spectral voice activity detector
Author :
Garner, Philip N. ; Fukada, Toshiaki ; Komori, Yasuhiro
Author_Institution :
Canon Inc, Tokyo, Japan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The voice activity detection (VAD) problem is placed into a decision theoretic framework, and the Gaussian VAD model of Sohn et al. (1998, 1999) is then shown to fit well with the framework. It is argued that the Gaussian model can be made more robust to correlation and expected spectral shapes of speech and noise by using a differential spectral representation. Such a model is formulated theoretically. The differential spectral VAD is then shown by experiment to compare favourably with the basic Gaussian VAD in a speech recognition setting, especially for noisy environments.
Keywords :
Gaussian distribution; decision theory; signal representation; spectral analysis; speech recognition; Gaussian VAD model; correlation robustness; differential spectral representation; spectral shapes; speech recognition; voice activity detector; Automatic speech recognition; Cost function; Detectors; Gaussian noise; Noise robustness; Noise shaping; Spectral shape; Speech enhancement; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326056
Filename :
1326056
Link To Document :
بازگشت