DocumentCode
2148302
Title
A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics
Author
Mysore, Gautham J. ; Smaragdis, Paris
fYear
2011
fDate
22-27 May 2011
Firstpage
17
Lastpage
20
Abstract
We present a semi-supervised source separation methodology to denoise speech by modeling speech as one source and noise as the other source. We model speech using the recently pro posed non-negative hidden Markov model, which uses multiple non-negative dictionaries and a Markov chain to jointly model spectral structure and temporal dynamics of speech. We perform separation of the speech and noise using the recently proposed non-negative factorial hidden Markov model. Although the speech model is learned from training data, the noise model is learned during the separation process and re quires no training data. We show that the proposed method achieves superior results to using non-negative spectrogram factorization, which ignores the non-stationarity and temporal dynamics of speech.
Keywords
hidden Markov models; signal denoising; source separation; speech processing; Markov chain; multiple nonnegative dictionaries; nonnegative approach; nonnegative factorial hidden Markov model; nonnegative hidden Markov model; semisupervised separation; semisupervised source separation; spectral structure; speech denoising; speech modeling; temporal dynamics; Dictionaries; Hidden Markov models; Noise; Noise reduction; Source separation; Spectrogram; Speech; Denoising; Semi-supervised source separation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946317
Filename
5946317
Link To Document