DocumentCode :
697908
Title :
An approach to under-determined speech separation based on a non-linear mixture of beamformers
Author :
Dmour, Mohammad A. ; Davies, Michael E.
Author_Institution :
Inst. for Digital Commun., Univ. of Edinburgh, Edinburgh, UK
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1452
Lastpage :
1456
Abstract :
This paper describes frequency-domain non-linear beamformers that can extract a target speech source from among multiple interfering speech sources when there are fewer microphones than sources (the under-determined case). Our approach models the data in each frequency bin via Gaussian mixture distributions, which can be learnt using the expectation maximisation (EM) algorithm. A non-linear beamformer is then developed, based on this model. The proposed non-linear beamformer is a non-linear weighted sum of linear minimum mean square error (MMSE) or minimum variance distortionless response (MVDR) beamformers. The resulting beamformer requires the direction of arrival of the target speech source to be known in advance, but the number of interferers does not need to be known or estimated. Simulations of the non-linear beamformers in under-determined mixtures with room reverberation confirm its capability to successfully separate speech sources.
Keywords :
Gaussian distribution; array signal processing; direction-of-arrival estimation; expectation-maximisation algorithm; frequency-domain analysis; least mean squares methods; microphones; mixture models; reverberation; source separation; speech processing; EM algorithm; Gaussian mixture distributions; MVDR beamformer nonlinear mixture; direction-of-arrival estimation; expectation maximisation algorithm; frequency bin; frequency-domain analysis; linear MMSE nonlinear weighted sum; linear minimum mean square error; microphones; minimum variance distortionless response beamformer; room reverberation; speech source extraction; speech source separation; Frequency-domain analysis; Interference; Mathematical model; Microphones; Microwave integrated circuits; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077480
Link To Document :
بازگشت