DocumentCode :
3348313
Title :
Co-channel audiovisual speech separation using spectral matching constraints
Author :
Dansereau, R.M.
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper, the problem of co-channel speech separation for convolutive mixtures is considered where visual cues from one of the speakers is available as side information. The visual cues from the one speaker in the two speaker speech separation are used to estimate the spectral content of the speech and this spectral estimate is in turn used to constrain the solution of the coupling reconstruction filters in the convolutive mixture. The preliminary experimental results show that good performance in speech separation is obtained for our limited case study of visual cues obtained from the spoken numbers of "one" thru "four".
Keywords :
convolution; decorrelation; feature extraction; hidden Markov models; signal reconstruction; source separation; speech processing; video signal processing; co-channel audiovisual speech separation; continuous density HMM; convolutive speech mixtures; coupling reconstruction filters; decorrelation filters; left-right hidden Markov model; lip feature extraction; source signal reconstruction; spectral matching constraints; speech spectral content; two speaker speech separation; visual cue side information; Decorrelation; Drives; Filters; Lips; Loudspeakers; Microphones; Motion estimation; Source separation; Speech; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327193
Filename :
1327193
Link To Document :
بازگشت