• DocumentCode
    1135519
  • Title

    An Improved Voice Activity Detection Using Higher Order Statistics

  • Author

    Li, Ke ; Swamy, M.N.S. ; Ahmad, M. Omair

  • Author_Institution
    Siemens, Beijing, China
  • Volume
    13
  • Issue
    5
  • fYear
    2005
  • Firstpage
    965
  • Lastpage
    974
  • Abstract
    In this paper, by using the properties of the higher order statistics (HOS) of speech and noise signals, we develop an improved voice activity detection (VAD) scheme. The proposed scheme employs the logarithm of the kurtosis of the LPC residual of a speech signal and is shown to be more effective and efficient in detecting active speech in medium to low signal-to-noise ratio (SNR) conditions without being unduly affected by the variations in the signal energy. To overcome the inability of the HOS in detecting unvoiced speech, another metric (the low band to full band energy ratio) is introduced. Depending on the estimated mean SNR, the proposed scheme works adaptively in two modes: a simple mode using only the SNR, and an enhanced mode using the HOS, the low band to full band energy ratio and the SNR. This scheme is capable of avoiding unnecessary computations, while maintaining the same performance as that working only in the enhanced mode. Simulations results are presented to demonstrate the effectiveness of the proposed voice activity detection scheme.
  • Keywords
    higher order statistics; speech processing; full band energy ratio; higher order statistics; improved voice activity detection; noise signals; signal-to-noise ratio; speech signals; Active noise reduction; Computational modeling; Decorrelation; Detectors; Fuzzy sets; Higher order statistics; Linear predictive coding; Signal to noise ratio; Speech enhancement; Working environment noise; Higher order statistics; low band to full band energy ratio; voice activity detection;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.851955
  • Filename
    1495478