DocumentCode
1797326
Title
Hybrid SVM/HMM architectures for statistical model-based voice activity detection
Author
Ying-Wei Tan ; Wen-Ju Liu ; Wei Jiang ; Hao Zheng
Author_Institution
Dept. of Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
fYear
2014
fDate
6-11 July 2014
Firstpage
2875
Lastpage
2878
Abstract
The decision function of support vector machine (SVM) using the likelihood ratios (LRs) is successfully used for statistical model-based voice activity detection (VAD). It is known to incorporate an optimised nonlinear decision over two different classes, instead of comparing the geometric mean of the LRs for the individual frequency bands with a given threshold for speech detection. However, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we explore a hybrid SVM/hidden Markov model (HMM) approach for the VAD, which retains discriminative and nonlinear properties of SVM, while modeling the interframe correlation powerfully through a first-order HMM. Experimental results show the significant improvement of the performance of the proposed VAD in comparison with the SVM-based VAD.
Keywords
hidden Markov models; maximum likelihood estimation; speech processing; support vector machines; HMM; LRs; SVM; VAD; hidden Markov model; likelihood ratios; speech detection; statistical model-based voice activity detection; support vector machine; Correlation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889403
Filename
6889403
Link To Document