Title :
Two-stage underdetermined speech source separation using frequency normalization
Author :
Reddy, V.V. ; Sattar, F. ; Ng, B.P. ; Driessen, P.F.
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we consider the problem of underdetermined blind source separation for anechoic speech recordings. Existing two-stage methods which first estimate mixing matrix and then separate sources are suitable only for instantaneous mixtures and do not cater for anechoic speech recordings. Other time-frequency (TF) methods based on binary masks are found to have limited performance. We here propose a new two-stage technique which includes frequency normalization to estimate mixing matrix followed by a source separation stage involving denormalization process to estimate frequency dependent mixing matrices. Experimental results are provided to demonstrate the advantage of the proposed method over other methods.
Keywords :
blind source separation; estimation theory; matrix algebra; recording; speech processing; time-frequency analysis; anechoic speech recording; binary mask; denormalization process; frequency dependent mixing matrix estimation; frequency normalization; time-frequency method; two-stage underdetermined speech source separation; underdetermined blind source separation; Arrays; Estimation; Frequency estimation; Speech; Time frequency analysis; Vectors;
Conference_Titel :
Communications, Computers and Signal Processing (PacRim), 2011 IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4577-0252-5
Electronic_ISBN :
1555-5798
DOI :
10.1109/PACRIM.2011.6032970