DocumentCode :
542324
Title :
Blind separation of non-linear convolved speech mixtures
Author :
Koutras, Athanaios
Author_Institution :
WCL, Electrical and Computer Engineering, University of Patras, Greece
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In this paper we present a novel solution to the convolutive and post non-linear Blind Speech Separation (NLBSS) problem based on a neural network topology. The non-linear separating functions are chosen to be a mixture of parametric sigmoid functions. The estimation of the separating filter coefficients and the parameters of the separating non-linear functions is derived using the Maximum Likelihood Estimation principle. Extensive experiments using complex non-linear mixing functions and real room impulse responses were carried out to simulate a mixing scenario of two simultaneous speakers in a real room environment under the effect of high non-linear distortions. The proposed method succeeds in separating the non-linearly mixed signals and improves the phoneme recognition accuracy of an automatic speech recognition system by more than 20% in comparison to the accuracy measured with the non-linear mixture signals. Furthermore, this method outperforms standard linear ESS methods by 20%, justifying the necessity for non-linear separating functions integration in speech recognition systems in a non-linear and multi-simultaneous speaker environment.
Keywords :
Discrete wavelet transforms; Finite impulse response filter; Noise measurement; Nonlinear distortion; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743888
Filename :
5743888
Link To Document :
بازگشت