• DocumentCode
    835681
  • Title

    Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold

  • Author

    Davis, Alan ; Nordholm, Sven ; Togneri, Roberto

  • Volume
    14
  • Issue
    2
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    412
  • Lastpage
    424
  • Abstract
    Traditionally, voice activity detection algorithms are based on any combination of general speech properties such as temporal energy variations, periodicity, and spectrum. This paper describes a novel statistical method for voice activity detection using a signal-to-noise ratio measure. The method employs a low-variance spectrum estimate and determines an optimal threshold based on the estimated noise statistics. A possible implementation is presented and evaluated over a large test set and compared to current modern standardized algorithms. The evaluations indicate promising results with the proposed scheme being comparable or favorable over the whole test set.
  • Keywords
    parameter estimation; speech processing; statistical analysis; adaptive threshold; low-variance spectrum estimation; noise statistics estimation; signal-to-noise ratio; statistical method; statistical voice activity detection; Australia; Detection algorithms; IEEE activities; Internet telephony; Signal to noise ratio; Spectral analysis; Speech enhancement; Statistical analysis; Statistics; Testing; Adaptive voice activity detection; statistical decision; voice activity detection (VAD); voice activity detector;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.855842
  • Filename
    1597247