DocumentCode :
2148302
Title :
A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics
Author :
Mysore, Gautham J. ; Smaragdis, Paris
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
17
Lastpage :
20
Abstract :
We present a semi-supervised source separation methodology to denoise speech by modeling speech as one source and noise as the other source. We model speech using the recently pro posed non-negative hidden Markov model, which uses multiple non-negative dictionaries and a Markov chain to jointly model spectral structure and temporal dynamics of speech. We perform separation of the speech and noise using the recently proposed non-negative factorial hidden Markov model. Although the speech model is learned from training data, the noise model is learned during the separation process and re quires no training data. We show that the proposed method achieves superior results to using non-negative spectrogram factorization, which ignores the non-stationarity and temporal dynamics of speech.
Keywords :
hidden Markov models; signal denoising; source separation; speech processing; Markov chain; multiple nonnegative dictionaries; nonnegative approach; nonnegative factorial hidden Markov model; nonnegative hidden Markov model; semisupervised separation; semisupervised source separation; spectral structure; speech denoising; speech modeling; temporal dynamics; Dictionaries; Hidden Markov models; Noise; Noise reduction; Source separation; Spectrogram; Speech; Denoising; Semi-supervised source separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946317
Filename :
5946317
Link To Document :
بازگشت