DocumentCode :
700052
Title :
Voice activity detection in the presence of breathing noise using neural network and hidden Markov model
Author :
Myllymaki, Mikko ; Virtanen, Tuomas
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
This paper proposes a voice activity detection algorithm to be used in the presence of breathing noise. We use a hybrid approach where a neural network is first applied in individual frames using mel-band energies within the frame as inputs. The output of the neural network is then processed using a hidden Markov model, which takes into account the temporally continuous nature of speech activity. Both the neural network and the hidden Markov model can be trained in supervised manner. On simulations with realistic acoustic material, the proposed method achieved average frame-level sensitivity above 97% and average specificity above 95%. The proposed algorithm enables a good rejection of noise and breathing frames while retaining the intelligibility of input speech.
Keywords :
hidden Markov models; neural nets; speech intelligibility; acoustic material; breathing frames; breathing noise; frame-level sensitivity; hidden Markov model; hybrid approach; input speech intelligibility; mel-band energy; neural network; speech activity; supervised manner; voice activity detection; Artificial neural networks; Hidden Markov models; Noise; Sensitivity; Signal processing algorithms; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080584
Link To Document :
بازگشت