DocumentCode
1293776
Title
Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection
Author
Yu, Tao ; Hansen, John H L
Author_Institution
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Volume
17
Issue
11
fYear
2010
Firstpage
897
Lastpage
900
Abstract
It is possible to show that the likelihood ratio (LR) test from multiple observations can enhance the performance of a statically modeled voice actively detection (VAD) system. However, the combination weights for the likelihood ratios (LRs) in each observation are rather empirical and heuristical. In this study, the optimal combination weights from two discriminative training methods are studied to directly improve VAD performance, in terms of reduced misclassification errors and improved receiver operating characteristics (ROC) curves. As shown in the evaluations, VAD performance, both in terms of absolute performance and consistency across noise types, can be significantly improved using the proposed method.
Keywords
receivers; speech synthesis; VAD performance; discriminative training methods; misclassification errors; multiple observation likelihood ratio; receiver operating characteristics; voice activity detection; Correlation; Fluctuations; Frequency; Hidden Markov models; Noise; Noise generators; Permission; Receivers; Robustness; Signal to noise ratio; Speech; Speech enhancement; Subcontracting; System testing; Training; Training data; Discriminative training; receiver operating characteristics (ROC); voice activity detection (VAD);
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2010.2066561
Filename
5546913
Link To Document