• DocumentCode
    2807054
  • Title

    Blind speech separation employing directional statistics in an Expectation Maximization framework

  • Author

    Vu, Dang Hai Tran ; Haeb-Umbach, Reinhold

  • Author_Institution
    Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    241
  • Lastpage
    244
  • Abstract
    In this paper we propose to employ directional statistics in a complex vector space to approach the problem of blind speech separation in the presence of spatially correlated noise. We interpret the values of the short time Fourier transform of the microphone signals to be draws from a mixture of complex Watson distributions, a probabilistic model which naturally accounts for spatial aliasing. The parameters of the density are related to the a priori source probabilities, the power of the sources and the transfer function ratios from sources to sensors. Estimation formulas are derived for these parameters by employing the Expectation Maximization (EM) algorithm. The E-step corresponds to the estimation of the source presence probabilities for each time-frequency bin, while the M-step leads to a maximum signal-to-noise ratio (MaxSNR) beamformer in the presence of uncertainty about the source activity. Experimental results are reported for an implementation in a generalized sidelobe canceller (GSC) like spatial beamforming configuration for 3 speech sources with significant coherent noise in reverberant environments, demonstrating the usefulness of the novel modeling framework.
  • Keywords
    Fourier transforms; array signal processing; blind source separation; expectation-maximisation algorithm; interference suppression; speech enhancement; statistical distributions; Fourier transform; blind speech separation; complex Watson distribution; complex vector space; directional statistics; expectation maximization algorithm; generalized sidelobe canceller; maximum signal-to-noise ratio beamformer; microphone signal; probabilistic model; spatial aliasing; spatial beamforming configuration; speech enhancement; Array signal processing; Fourier transforms; Microphones; Signal to noise ratio; Speech enhancement; Statistical distributions; Statistics; Time frequency analysis; Transfer functions; Uncertainty; Directional Statistics; EM-Algorithm; Noisy Source Separation; Sparse Signal Separation; Speech Enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495994
  • Filename
    5495994