Title :
Likelihood ratio sign test for voice activity detection
Author :
Deng, Shaozhi ; Han, Jinguang
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
fDate :
6/1/2012 12:00:00 AM
Abstract :
Voice activity detection (VAD) plays an important role on the performance of speech processing systems in adverse environments. Recently, statistical model-based VADs have demonstrated impressive performance. The study presents a novel decision test (named likelihood ratio sign test, LRST) for VAD by using sign test and Neyman-Pearson criterion to improve the performance of statistical model-based VAD. The proposed LRST is derived based on the likelihood ratios (LRs) calculated from multiple independent observations by incorporating the long-term speech information into the decision rule. An implementation of the LRST VAD is introduced by defining the LRST over a sliding window and calculating the LRs based on complex Gaussian distribution for an input signal. For experiments, the multiple-observation LRT (MO-LRT) VAD based on multiple observations is used as a reference owing to its outstanding performance compared with conventional VADs. The experimental results show that the proposed approach outperforms the MO-LRT VAD in various noise environments.
Keywords :
Gaussian distribution; maximum likelihood estimation; speech processing; statistical analysis; LR; LRST VAD; Neyman-Pearson criterion; complex Gaussian distribution; decision rule; decision test; input signal; likelihood ratio sign test; long-term speech information; multiple independent observations; multiple observations-based MO-LRT VAD; multiple-observation LRT VAD; sliding window; speech processing systems; statistical model-based VAD; voice activity detection;
Journal_Title :
Signal Processing, IET
DOI :
10.1049/iet-spr.2011.0109