• DocumentCode
    730113
  • Title

    Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model

  • Author

    Walter, Oliver ; Drude, Lukas ; Haeb-Umbach, Reinhold

  • Author_Institution
    Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    459
  • Lastpage
    463
  • Abstract
    In this paper we present a source counting algorithm to determine the number of speakers in a speech mixture. In our proposed method, we model the histogram of estimated directions of arrival with a non-parametric Bayesian infinite Gaussian mixture model. As an alternative to classical model selection criteria and to avoid specifying the maximum number of mixture components in advance, a Dirichlet process prior is employed over the mixture components. This allows to automatically determine the optimal number of mixture components that most probably model the observations. We demonstrate by experiments that this model outperforms a parametric approach using a finite Gaussian mixture model with a Dirichlet distribution prior over the mixture weights.
  • Keywords
    direction-of-arrival estimation; speaker recognition; speech processing; Dirichlet distribution; Dirichlet process; directions of arrival; infinite Gaussian mixture model; nonparametric Bayesian estimation; source counting algorithm; speakers; speech mixtures; Direction-of-arrival estimation; Gaussian mixture model; Microphones; Mixture models; Signal to noise ratio; Speech; Blind source separation; Chinese restaurant process; Source counting; nonparametric Bayesian methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178011
  • Filename
    7178011