DocumentCode :
1187231
Title :
Permutation inconsistency in blind speech separation: investigation and solutions
Author :
Ikram, Muhammad Z. ; Morgan, Dennis R.
Author_Institution :
DSP Solutions R&D Center, Texas Instrum. Inc., Dallas, TX, USA
Volume :
13
Issue :
1
fYear :
2005
Firstpage :
1
Lastpage :
13
Abstract :
Acoustic reverberation severely limits the performance of multiple microphone blind speech separation (BSS) methods. We show that the limited performance is due to random permutations of the unmixing filters over frequency. This problem, which we refer to as permutation inconsistency, becomes worse as the length of the room impulse response increases. We explore interesting connections between BSS and ideal beamforming, which leads us to propose a permutation alignment scheme based on microphone array directivity patterns. Given that the permutations are properly aligned, we show that the blind speech separation method outperforms the nonblind beamformer in a highly reverberant environment. Furthermore, we discover the tradeoff where permutations can be aligned by affording a loss in spectral resolution of the unmixing filters. We then propose a multistage algorithm, which aligns the unmixing filter permutations without sacrificing the spectral resolution. For our study, we perform experiments in both real and simulated environments and compare the results to the ideal performance benchmarks that we derive using prior knowledge of the mixing filters.
Keywords :
blind source separation; filters; reverberation; speech processing; transient response; acoustic reverberation; ideal beamforming; microphone array directivity pattern; multiple microphone blind speech separation; multistage algorithm; permutation alignment scheme; permutation inconsistency; random permutation; room impulse response; spectral resolution; unmixing filter; unmixing filter permutation; Acoustics; Array signal processing; Filters; Frequency; Helium; Microphone arrays; Reverberation; Signal resolution; Source separation; Speech enhancement;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.834441
Filename :
1369307
Link To Document :
بازگشت