DocumentCode
697908
Title
An approach to under-determined speech separation based on a non-linear mixture of beamformers
Author
Dmour, Mohammad A. ; Davies, Michael E.
Author_Institution
Inst. for Digital Commun., Univ. of Edinburgh, Edinburgh, UK
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
1452
Lastpage
1456
Abstract
This paper describes frequency-domain non-linear beamformers that can extract a target speech source from among multiple interfering speech sources when there are fewer microphones than sources (the under-determined case). Our approach models the data in each frequency bin via Gaussian mixture distributions, which can be learnt using the expectation maximisation (EM) algorithm. A non-linear beamformer is then developed, based on this model. The proposed non-linear beamformer is a non-linear weighted sum of linear minimum mean square error (MMSE) or minimum variance distortionless response (MVDR) beamformers. The resulting beamformer requires the direction of arrival of the target speech source to be known in advance, but the number of interferers does not need to be known or estimated. Simulations of the non-linear beamformers in under-determined mixtures with room reverberation confirm its capability to successfully separate speech sources.
Keywords
Gaussian distribution; array signal processing; direction-of-arrival estimation; expectation-maximisation algorithm; frequency-domain analysis; least mean squares methods; microphones; mixture models; reverberation; source separation; speech processing; EM algorithm; Gaussian mixture distributions; MVDR beamformer nonlinear mixture; direction-of-arrival estimation; expectation maximisation algorithm; frequency bin; frequency-domain analysis; linear MMSE nonlinear weighted sum; linear minimum mean square error; microphones; minimum variance distortionless response beamformer; room reverberation; speech source extraction; speech source separation; Frequency-domain analysis; Interference; Mathematical model; Microphones; Microwave integrated circuits; Speech; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077480
Link To Document