DocumentCode
3348313
Title
Co-channel audiovisual speech separation using spectral matching constraints
Author
Dansereau, R.M.
Author_Institution
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
Volume
5
fYear
2004
fDate
17-21 May 2004
Abstract
In this paper, the problem of co-channel speech separation for convolutive mixtures is considered where visual cues from one of the speakers is available as side information. The visual cues from the one speaker in the two speaker speech separation are used to estimate the spectral content of the speech and this spectral estimate is in turn used to constrain the solution of the coupling reconstruction filters in the convolutive mixture. The preliminary experimental results show that good performance in speech separation is obtained for our limited case study of visual cues obtained from the spoken numbers of "one" thru "four".
Keywords
convolution; decorrelation; feature extraction; hidden Markov models; signal reconstruction; source separation; speech processing; video signal processing; co-channel audiovisual speech separation; continuous density HMM; convolutive speech mixtures; coupling reconstruction filters; decorrelation filters; left-right hidden Markov model; lip feature extraction; source signal reconstruction; spectral matching constraints; speech spectral content; two speaker speech separation; visual cue side information; Decorrelation; Drives; Filters; Lips; Loudspeakers; Microphones; Motion estimation; Source separation; Speech; Systems engineering and theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1327193
Filename
1327193
Link To Document