• DocumentCode
    697908
  • Title

    An approach to under-determined speech separation based on a non-linear mixture of beamformers

  • Author

    Dmour, Mohammad A. ; Davies, Michael E.

  • Author_Institution
    Inst. for Digital Commun., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1452
  • Lastpage
    1456
  • Abstract
    This paper describes frequency-domain non-linear beamformers that can extract a target speech source from among multiple interfering speech sources when there are fewer microphones than sources (the under-determined case). Our approach models the data in each frequency bin via Gaussian mixture distributions, which can be learnt using the expectation maximisation (EM) algorithm. A non-linear beamformer is then developed, based on this model. The proposed non-linear beamformer is a non-linear weighted sum of linear minimum mean square error (MMSE) or minimum variance distortionless response (MVDR) beamformers. The resulting beamformer requires the direction of arrival of the target speech source to be known in advance, but the number of interferers does not need to be known or estimated. Simulations of the non-linear beamformers in under-determined mixtures with room reverberation confirm its capability to successfully separate speech sources.
  • Keywords
    Gaussian distribution; array signal processing; direction-of-arrival estimation; expectation-maximisation algorithm; frequency-domain analysis; least mean squares methods; microphones; mixture models; reverberation; source separation; speech processing; EM algorithm; Gaussian mixture distributions; MVDR beamformer nonlinear mixture; direction-of-arrival estimation; expectation maximisation algorithm; frequency bin; frequency-domain analysis; linear MMSE nonlinear weighted sum; linear minimum mean square error; microphones; minimum variance distortionless response beamformer; room reverberation; speech source extraction; speech source separation; Frequency-domain analysis; Interference; Mathematical model; Microphones; Microwave integrated circuits; Speech; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077480