DocumentCode :
730113
Title :
Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model
Author :
Walter, Oliver ; Drude, Lukas ; Haeb-Umbach, Reinhold
Author_Institution :
Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
459
Lastpage :
463
Abstract :
In this paper we present a source counting algorithm to determine the number of speakers in a speech mixture. In our proposed method, we model the histogram of estimated directions of arrival with a non-parametric Bayesian infinite Gaussian mixture model. As an alternative to classical model selection criteria and to avoid specifying the maximum number of mixture components in advance, a Dirichlet process prior is employed over the mixture components. This allows to automatically determine the optimal number of mixture components that most probably model the observations. We demonstrate by experiments that this model outperforms a parametric approach using a finite Gaussian mixture model with a Dirichlet distribution prior over the mixture weights.
Keywords :
direction-of-arrival estimation; speaker recognition; speech processing; Dirichlet distribution; Dirichlet process; directions of arrival; infinite Gaussian mixture model; nonparametric Bayesian estimation; source counting algorithm; speakers; speech mixtures; Direction-of-arrival estimation; Gaussian mixture model; Microphones; Mixture models; Signal to noise ratio; Speech; Blind source separation; Chinese restaurant process; Source counting; nonparametric Bayesian methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178011
Filename :
7178011
Link To Document :
بازگشت