• DocumentCode
    1271684
  • Title

    Likelihood ratio sign test for voice activity detection

  • Author

    Deng, Shaozhi ; Han, Jinguang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • Volume
    6
  • Issue
    4
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    306
  • Lastpage
    312
  • Abstract
    Voice activity detection (VAD) plays an important role on the performance of speech processing systems in adverse environments. Recently, statistical model-based VADs have demonstrated impressive performance. The study presents a novel decision test (named likelihood ratio sign test, LRST) for VAD by using sign test and Neyman-Pearson criterion to improve the performance of statistical model-based VAD. The proposed LRST is derived based on the likelihood ratios (LRs) calculated from multiple independent observations by incorporating the long-term speech information into the decision rule. An implementation of the LRST VAD is introduced by defining the LRST over a sliding window and calculating the LRs based on complex Gaussian distribution for an input signal. For experiments, the multiple-observation LRT (MO-LRT) VAD based on multiple observations is used as a reference owing to its outstanding performance compared with conventional VADs. The experimental results show that the proposed approach outperforms the MO-LRT VAD in various noise environments.
  • Keywords
    Gaussian distribution; maximum likelihood estimation; speech processing; statistical analysis; LR; LRST VAD; Neyman-Pearson criterion; complex Gaussian distribution; decision rule; decision test; input signal; likelihood ratio sign test; long-term speech information; multiple independent observations; multiple observations-based MO-LRT VAD; multiple-observation LRT VAD; sliding window; speech processing systems; statistical model-based VAD; voice activity detection;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9675
  • Type

    jour

  • DOI
    10.1049/iet-spr.2011.0109
  • Filename
    6280857