DocumentCode
835681
Title
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
Author
Davis, Alan ; Nordholm, Sven ; Togneri, Roberto
Volume
14
Issue
2
fYear
2006
fDate
3/1/2006 12:00:00 AM
Firstpage
412
Lastpage
424
Abstract
Traditionally, voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal-to-noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set.
Keywords
parameter estimation; speech processing; statistical analysis; adaptive threshold; low-variance spectrum estimation; noise statistics estimation; signal-to-noise ratio; statistical method; statistical voice activity detection; Australia; Detection algorithms; IEEE activities; Internet telephony; Signal to noise ratio; Spectral analysis; Speech enhancement; Statistical analysis; Statistics; Testing; Adaptive voice activity detection; statistical decision; voice activity detection (VAD); voice activity detector;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.855842
Filename
1597247
Link To Document