DocumentCode :
2236991
Title :
Multichannel voice detection in adverse environments
Author :
Rosca, J. ; Balan, R. ; Fan, N.P. ; Beaugeant, C. ; Gilg, V.
Author_Institution :
Dept. of Multimedia & Video Technol., Siemens Corp. Res., Princeton, NJ, USA
fYear :
2002
fDate :
3-6 Sept. 2002
Firstpage :
1
Lastpage :
4
Abstract :
Detecting when voice is or is not present is an outstanding problem for speech transmission, enhancement and recognition. Here we present a novel multichannel source activity detector that exploits the spatial localization of the target audio source. The detector uses an array signal processing technique to maximize the signal-to-interference ratio for the target source thus decreasing the activity detection error rate. We compare our two-channel voice activity detector (VAD) with the AMR voice detection algorithms on real data recorded in a noisy car environment. The new algorithm shows improvements in error rates of 55-70% compared to the state-of-the-art adaptive multi-rate algorithm AMR2 used in present voice transmission technology.
Keywords :
acoustic signal detection; array signal processing; error statistics; speech processing; AMR voice detection algorithms; AMR2; VAD; activity detection error rate; adaptive multirate algorithm; array signal processing technique; multichannel source activity detector; signal-to-interference ratio; spatial localization; target audio source; two-channel voice activity detector; Abstracts; Detectors; Filtering algorithms; Maximum likelihood detection; Microwave integrated circuits; Noise; Nonlinear filters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse
ISSN :
2219-5491
Type :
conf
Filename :
7072135
Link To Document :
بازگشت