DocumentCode :
1541351
Title :
Analysis and improvement of a statistical model-based voice activity detector
Author :
Cho, Yong Duk ; Kondoz, Ahmet
Author_Institution :
Centre for Commun. Syst. Research, Surrey Univ., Guildford, UK
Volume :
8
Issue :
10
fYear :
2001
Firstpage :
276
Lastpage :
278
Abstract :
From an investigation of a statistical model-based voice activity detector (VAD), it is found that the likelihood ratio defined in the VAD has a fundamental problem at the offset regions of the speech signals. Thus, we analyze the behavioural mechanism of the likelihood ratio, identify the reason for the unwanted phenomenon, and propose a solution based on a smoothed likelihood ratio. Objective test results show that the proposed method gives a significant improvement to the original VAD. Additionally, the improved VAD results in a performance that is even superior to G.729B and comparable to AMR VAD option 2.
Keywords :
signal detection; speech processing; statistical analysis; AMR VAD option 2; G.729B; behavioural mechanism; likelihood ratio; objective test results; offset regions; performance; smoothed likelihood ratio; speech signals; statistical model-based voice activity detector; Additive noise; Amplitude estimation; Channel capacity; Delay; Detectors; Gaussian noise; Signal to noise ratio; Speech analysis; Speech enhancement; Testing;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/97.957270
Filename :
957270
Link To Document :
بازگشت