Title :
Voice activity detection in the presence of breathing noise using neural network and hidden Markov model
Author :
Myllymaki, Mikko ; Virtanen, Tuomas
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
Abstract :
This paper proposes a voice activity detection algorithm to be used in the presence of breathing noise. We use a hybrid approach where a neural network is first applied in individual frames using mel-band energies within the frame as inputs. The output of the neural network is then processed using a hidden Markov model, which takes into account the temporally continuous nature of speech activity. Both the neural network and the hidden Markov model can be trained in supervised manner. On simulations with realistic acoustic material, the proposed method achieved average frame-level sensitivity above 97% and average specificity above 95%. The proposed algorithm enables a good rejection of noise and breathing frames while retaining the intelligibility of input speech.
Keywords :
hidden Markov models; neural nets; speech intelligibility; acoustic material; breathing frames; breathing noise; frame-level sensitivity; hidden Markov model; hybrid approach; input speech intelligibility; mel-band energy; neural network; speech activity; supervised manner; voice activity detection; Artificial neural networks; Hidden Markov models; Noise; Sensitivity; Signal processing algorithms; Speech; Speech processing;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne