Title :
Improved voice activity detection based on a smoothed statistical likelihood ratio
Author :
Cho, Yong Duk ; Al-Naimi, K. ; Kondoz, Ahmet
Author_Institution :
Centre for Commun. Syst. Res., Surrey Univ., Guildford, UK
Abstract :
This paper presents the behavioural mechanism of a statistical model-based voice activity detector (VAD), featuring a likelihood ratio test for the activity decision. From investigation of the VAD, it is found that detection errors could occur frequently at speech offset regions because of the delay term in the decision-directed parameter estimator, employed for the estimation of an unknown parameter of the likelihood ratio. Hence, this paper proposes a smoothed likelihood ratio so as to alleviate the detection errors at the offset region. Objective test results show that the proposed scheme is useful for achieving a considerable performance improvement for the VAD. Additionally, the proposed VAD gives detection performances superior to G.729B VAD and comparable with the AMR VAD option 2
Keywords :
noise; parameter estimation; signal detection; speech processing; statistical analysis; AMR VAD option 2; G.729B VAD; behavioural mechanism; decision-directed parameter estimator; delay; detection errors; detection performance; likelihood ratio test; noise estimation; objective test results; smoothed likelihood ratio; speech offset regions; statistical model based voice activity detector; Additive noise; Amplitude estimation; Channel capacity; Delay estimation; Detectors; Gaussian noise; Mobile communication; Signal to noise ratio; Speech enhancement; System testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941020