DocumentCode
730113
Title
Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model
Author
Walter, Oliver ; Drude, Lukas ; Haeb-Umbach, Reinhold
Author_Institution
Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
fYear
2015
fDate
19-24 April 2015
Firstpage
459
Lastpage
463
Abstract
In this paper we present a source counting algorithm to determine the number of speakers in a speech mixture. In our proposed method, we model the histogram of estimated directions of arrival with a non-parametric Bayesian infinite Gaussian mixture model. As an alternative to classical model selection criteria and to avoid specifying the maximum number of mixture components in advance, a Dirichlet process prior is employed over the mixture components. This allows to automatically determine the optimal number of mixture components that most probably model the observations. We demonstrate by experiments that this model outperforms a parametric approach using a finite Gaussian mixture model with a Dirichlet distribution prior over the mixture weights.
Keywords
direction-of-arrival estimation; speaker recognition; speech processing; Dirichlet distribution; Dirichlet process; directions of arrival; infinite Gaussian mixture model; nonparametric Bayesian estimation; source counting algorithm; speakers; speech mixtures; Direction-of-arrival estimation; Gaussian mixture model; Microphones; Mixture models; Signal to noise ratio; Speech; Blind source separation; Chinese restaurant process; Source counting; nonparametric Bayesian methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178011
Filename
7178011
Link To Document