• DocumentCode
    1293776
  • Title

    Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection

  • Author

    Yu, Tao ; Hansen, John H L

  • Author_Institution
    Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
  • Volume
    17
  • Issue
    11
  • fYear
    2010
  • Firstpage
    897
  • Lastpage
    900
  • Abstract
    It is possible to show that the likelihood ratio (LR) test from multiple observations can enhance the performance of a statically modeled voice actively detection (VAD) system. However, the combination weights for the likelihood ratios (LRs) in each observation are rather empirical and heuristical. In this study, the optimal combination weights from two discriminative training methods are studied to directly improve VAD performance, in terms of reduced misclassification errors and improved receiver operating characteristics (ROC) curves. As shown in the evaluations, VAD performance, both in terms of absolute performance and consistency across noise types, can be significantly improved using the proposed method.
  • Keywords
    receivers; speech synthesis; VAD performance; discriminative training methods; misclassification errors; multiple observation likelihood ratio; receiver operating characteristics; voice activity detection; Correlation; Fluctuations; Frequency; Hidden Markov models; Noise; Noise generators; Permission; Receivers; Robustness; Signal to noise ratio; Speech; Speech enhancement; Subcontracting; System testing; Training; Training data; Discriminative training; receiver operating characteristics (ROC); voice activity detection (VAD);
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2010.2066561
  • Filename
    5546913