DocumentCode :
2333060
Title :
Latent Dirichlet Decomposition for Single Channel Speaker Separation
Author :
Raj, Bhiksha ; Shashanka, Madhusudana V S ; Smaragdis, Paris
Author_Institution :
Mitsubishi Electr. Res. Labs., Cambridge, MA
Volume :
5
fYear :
2006
fDate :
14-19 May 2006
Abstract :
We present an algorithm for the separation of multiple speakers from mixed single-channel recordings by latent variable decomposition of the speech spectrogram. We model each magnitude spectral vector in the short-time Fourier transform of a speech signal as the outcome of a discrete random process that generates frequency bin indices. The distribution of the process is modeled as a mixture of multinomial distributions, such that the mixture weights of the component multinomials vary from analysis window to analysis window. The component multinomials are assumed to be speaker specific and are learned from training signals for each speaker. We model the prior distribution of the mixture weights for each speaker as a Dirichlet distribution. The distributions representing magnitude spectral vectors for the mixed signal are decomposed into mixtures of the multinomials for all component speakers. The frequency distribution, i.e the spectrum for each speaker, is reconstructed from this decomposition
Keywords :
Fourier transforms; random processes; speech processing; discrete random process; frequency distribution; latent Dirichlet decomposition; magnitude spectral vector; magnitude spectral vectors; mixed single-channel recordings; multinomial distributions; short-time Fourier transform; single channel speaker separation; speech signal; speech spectrogram; Auditory system; Fourier transforms; Random processes; Signal generators; Signal processing; Spectrogram; Speech analysis; Speech processing; Time frequency analysis; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1661402
Filename :
1661402
Link To Document :
بازگشت