DocumentCode :
835681
Title :
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
Author :
Davis, Alan ; Nordholm, Sven ; Togneri, Roberto
Volume :
14
Issue :
2
fYear :
2006
fDate :
3/1/2006 12:00:00 AM
Firstpage :
412
Lastpage :
424
Abstract :
Traditionally, voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal-to-noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set.
Keywords :
parameter estimation; speech processing; statistical analysis; adaptive threshold; low-variance spectrum estimation; noise statistics estimation; signal-to-noise ratio; statistical method; statistical voice activity detection; Australia; Detection algorithms; IEEE activities; Internet telephony; Signal to noise ratio; Spectral analysis; Speech enhancement; Statistical analysis; Statistics; Testing; Adaptive voice activity detection; statistical decision; voice activity detection (VAD); voice activity detector;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.855842
Filename :
1597247
Link To Document :
بازگشت