Title :
Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection
Author :
Yu, Tao ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
It is possible to show that the likelihood ratio (LR) test from multiple observations can enhance the performance of a statically modeled voice actively detection (VAD) system. However, the combination weights for the likelihood ratios (LRs) in each observation are rather empirical and heuristical. In this study, the optimal combination weights from two discriminative training methods are studied to directly improve VAD performance, in terms of reduced misclassification errors and improved receiver operating characteristics (ROC) curves. As shown in the evaluations, VAD performance, both in terms of absolute performance and consistency across noise types, can be significantly improved using the proposed method.
Keywords :
receivers; speech synthesis; VAD performance; discriminative training methods; misclassification errors; multiple observation likelihood ratio; receiver operating characteristics; voice activity detection; Correlation; Fluctuations; Frequency; Hidden Markov models; Noise; Noise generators; Permission; Receivers; Robustness; Signal to noise ratio; Speech; Speech enhancement; Subcontracting; System testing; Training; Training data; Discriminative training; receiver operating characteristics (ROC); voice activity detection (VAD);
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2010.2066561