Title :
A non-causal approach to voice activity detection in adverse environments using a novel noise estimator
Author :
Esmaeili, A. ; Ahadi, S.M. ; Fassihi, M.A.
Author_Institution :
Electr. Eng. Dept., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
Analyzing the characteristics of the LR-based VAD, it was found that the delay associated with the decision directed (DD) a priori SNR estimator can lead to detection errors at speech onsets and offsets. In this paper the properties of a non-causal estimator, used before in a speech enhancement system, are investigated. It is shown that the application of the non-causal estimator improves the robustness of the VAD in noisy environments, specifically at low SNRs. In addition, the associated noise estimation procedure has been further improved by the application of a dynamic time varying smoothing factor. Objective tests conducted based on speech/non-speech discrimination show that the proposed VAD outperforms standard VAD algorithms, including ETSI-VADNest, AMR1, AMR2, and also the statistical VADs based on smoothed LR and multiple observation LR, specifically at low SNRs, at the cost of some delay.
Keywords :
smoothing methods; speech enhancement; statistical analysis; AMR1; AMR2; ETSI-VADNest; LR-based VAD; a priori SNR estimator; adverse environment; associated noise estimation procedure; dynamic time varying smoothing factor; noncausal estimator; nonspeech discrimination; speech discrimination; speech enhancement system; statistical VAD; voice activity detection; Delays; Signal to noise ratio; Smoothing methods; Speech; Speech enhancement; Voice Activity Detection (VAD); likelihood ratio; non-causal estimation;
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6