DocumentCode :
1293776
Title :
Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection
Author :
Yu, Tao ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Volume :
17
Issue :
11
fYear :
2010
Firstpage :
897
Lastpage :
900
Abstract :
It is possible to show that the likelihood ratio (LR) test from multiple observations can enhance the performance of a statically modeled voice actively detection (VAD) system. However, the combination weights for the likelihood ratios (LRs) in each observation are rather empirical and heuristical. In this study, the optimal combination weights from two discriminative training methods are studied to directly improve VAD performance, in terms of reduced misclassification errors and improved receiver operating characteristics (ROC) curves. As shown in the evaluations, VAD performance, both in terms of absolute performance and consistency across noise types, can be significantly improved using the proposed method.
Keywords :
receivers; speech synthesis; VAD performance; discriminative training methods; misclassification errors; multiple observation likelihood ratio; receiver operating characteristics; voice activity detection; Correlation; Fluctuations; Frequency; Hidden Markov models; Noise; Noise generators; Permission; Receivers; Robustness; Signal to noise ratio; Speech; Speech enhancement; Subcontracting; System testing; Training; Training data; Discriminative training; receiver operating characteristics (ROC); voice activity detection (VAD);
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2010.2066561
Filename :
5546913
Link To Document :
بازگشت