• DocumentCode
    1797326
  • Title

    Hybrid SVM/HMM architectures for statistical model-based voice activity detection

  • Author

    Ying-Wei Tan ; Wen-Ju Liu ; Wei Jiang ; Hao Zheng

  • Author_Institution
    Dept. of Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    2875
  • Lastpage
    2878
  • Abstract
    The decision function of support vector machine (SVM) using the likelihood ratios (LRs) is successfully used for statistical model-based voice activity detection (VAD). It is known to incorporate an optimised nonlinear decision over two different classes, instead of comparing the geometric mean of the LRs for the individual frequency bands with a given threshold for speech detection. However, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we explore a hybrid SVM/hidden Markov model (HMM) approach for the VAD, which retains discriminative and nonlinear properties of SVM, while modeling the interframe correlation powerfully through a first-order HMM. Experimental results show the significant improvement of the performance of the proposed VAD in comparison with the SVM-based VAD.
  • Keywords
    hidden Markov models; maximum likelihood estimation; speech processing; support vector machines; HMM; LRs; SVM; VAD; hidden Markov model; likelihood ratios; speech detection; statistical model-based voice activity detection; support vector machine; Correlation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889403
  • Filename
    6889403