DocumentCode :
463413
Title :
Speech Source Separation by Combining Localization Cues with Mixture Models of Speech Spectra
Author :
Wilson, Keith
Author_Institution :
Mitsubishi Electr. Res. Lab., Cambridge, MA, USA
Volume :
1
fYear :
2007
fDate :
15-20 April 2007
Abstract :
We present a method for simultaneous speech source separation in reverberant environments using both localization cues and a speech model. Previous source separation work has focused primarily on one or the other of these approaches; we use a novel localization cue observation noise model to allow for a natural combination of the approaches. We model speech as a Gaussian mixture model (GMM) of short-time spectral magnitudes and model localization cue noise using a time-varying noise model learned from labeled training data. We show that our technique outperforms competing techniques as measured by segmental signal-to-noise ratio (SNR) and segmental log-spectral distortion (LSD) and also show that our technique is robust to typical levels of audio localization error.
Keywords :
Gaussian processes; noise; source separation; speech processing; Gaussian mixture model; SNR; audio localization error; localization cues; mixture models; reverberant environments; segmental log-spectral distortion; short-time spectral magnitudes; signal-to-noise ratio; speech source separation; speech spectra; time-varying noise model; Acoustic arrays; Gaussian noise; Hidden Markov models; Microphone arrays; Position measurement; Signal to noise ratio; Source separation; Spectrogram; Speech enhancement; Working environment noise; Acoustic arrays; Array signal processing; Speech enhancement; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366609
Filename :
4217009
Link To Document :
بازگشت