• DocumentCode
    542324
  • Title

    Blind separation of non-linear convolved speech mixtures

  • Author

    Koutras, Athanaios

  • Author_Institution
    WCL, Electrical and Computer Engineering, University of Patras, Greece
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    In this paper we present a novel solution to the convolutive and post non-linear Blind Speech Separation (NLBSS) problem based on a neural network topology. The non-linear separating functions are chosen to be a mixture of parametric sigmoid functions. The estimation of the separating filter coefficients and the parameters of the separating non-linear functions is derived using the Maximum Likelihood Estimation principle. Extensive experiments using complex non-linear mixing functions and real room impulse responses were carried out to simulate a mixing scenario of two simultaneous speakers in a real room environment under the effect of high non-linear distortions. The proposed method succeeds in separating the non-linearly mixed signals and improves the phoneme recognition accuracy of an automatic speech recognition system by more than 20% in comparison to the accuracy measured with the non-linear mixture signals. Furthermore, this method outperforms standard linear ESS methods by 20%, justifying the necessity for non-linear separating functions integration in speech recognition systems in a non-linear and multi-simultaneous speaker environment.
  • Keywords
    Discrete wavelet transforms; Finite impulse response filter; Noise measurement; Nonlinear distortion; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743888
  • Filename
    5743888