• DocumentCode
    134210
  • Title

    Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability

  • Author

    Bin Liu ; Jianhua Tao ; Fuyuan Mo ; Ya Li ; Zhengqi Wen ; Shanfeng Liu

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    531
  • Lastpage
    535
  • Abstract
    Voice activity detection (VAD) is widely used for various speech-based systems which is an important pre-processing step. This paper proposes a robust voice activity detection algorithm. In the proposed algorithm, the sub-band temporal envelope and the sub-band long-term signal variability are considered to distinguish the speech from all kinds of non-speech which include stationary noise and non-stationary noise. The two features are combined to make a robust VAD decision according to the fusion decision. The proposed algorithm also is an unsupervised low-complexity algorithm and can operate without pre-train models. The experiments results show that the proposed algorithm is prior to the different baseline algorithms and can handle a variety of noise environments over a wide range of signal-to-noise ratios. The proposed algorithm could apply to speech-based systems.
  • Keywords
    speech processing; VAD algorithm; fusion decision; nonstationary noise; robust VAD decision; signal-to-noise ratios; speech-based systems; subband long-term signal variability; subband temporal envelope; unsupervised low-complexity algorithm; voice activity detection algorithm; Entropy; Feature extraction; Noise measurement; Robustness; Signal to noise ratio; Speech; fusion decision; sub-band long-term signal variability; sub-band temporal envelope; voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936602
  • Filename
    6936602