• DocumentCode
    3348313
  • Title

    Co-channel audiovisual speech separation using spectral matching constraints

  • Author

    Dansereau, R.M.

  • Author_Institution
    Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    In this paper, the problem of co-channel speech separation for convolutive mixtures is considered where visual cues from one of the speakers is available as side information. The visual cues from the one speaker in the two speaker speech separation are used to estimate the spectral content of the speech and this spectral estimate is in turn used to constrain the solution of the coupling reconstruction filters in the convolutive mixture. The preliminary experimental results show that good performance in speech separation is obtained for our limited case study of visual cues obtained from the spoken numbers of "one" thru "four".
  • Keywords
    convolution; decorrelation; feature extraction; hidden Markov models; signal reconstruction; source separation; speech processing; video signal processing; co-channel audiovisual speech separation; continuous density HMM; convolutive speech mixtures; coupling reconstruction filters; decorrelation filters; left-right hidden Markov model; lip feature extraction; source signal reconstruction; spectral matching constraints; speech spectral content; two speaker speech separation; visual cue side information; Decorrelation; Drives; Filters; Lips; Loudspeakers; Microphones; Motion estimation; Source separation; Speech; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327193
  • Filename
    1327193