Title :
Analysis and improvement of a statistical model-based voice activity detector
Author :
Cho, Yong Duk ; Kondoz, Ahmet
Author_Institution :
Centre for Commun. Syst. Research, Surrey Univ., Guildford, UK
Abstract :
From an investigation of a statistical model-based voice activity detector (VAD), it is found that the likelihood ratio defined in the VAD has a fundamental problem at the offset regions of the speech signals. Thus, we analyze the behavioural mechanism of the likelihood ratio, identify the reason for the unwanted phenomenon, and propose a solution based on a smoothed likelihood ratio. Objective test results show that the proposed method gives a significant improvement to the original VAD. Additionally, the improved VAD results in a performance that is even superior to G.729B and comparable to AMR VAD option 2.
Keywords :
signal detection; speech processing; statistical analysis; AMR VAD option 2; G.729B; behavioural mechanism; likelihood ratio; objective test results; offset regions; performance; smoothed likelihood ratio; speech signals; statistical model-based voice activity detector; Additive noise; Amplitude estimation; Channel capacity; Delay; Detectors; Gaussian noise; Signal to noise ratio; Speech analysis; Speech enhancement; Testing;
Journal_Title :
Signal Processing Letters, IEEE