DocumentCode :
2575278
Title :
A probabilistic speaker clustering for DOA-based diarization
Author :
Ishiguro, Katsuhiko ; Yamada, Takeshi ; Araki, Shoko ; Nakatani, Tomohiro
Author_Institution :
NTT Commun. Sci. Labs., Seika, Japan
fYear :
2009
fDate :
18-21 Oct. 2009
Firstpage :
241
Lastpage :
244
Abstract :
We present a probabilistic speaker clustering and diarization model. Speaker diarization determines ldquowho spoke whenrdquo from the recorded conversation of unknown number of people. We formulate this problem as the clustering of sequential auditory features generated by an unknown number of latent mixture components (speakers). We employ a probabilistic model which automatically estimates the number of speakers and time-varying speaker proportions. Experiments with synthetic and real sound recordings confirm that the proposed model can successfully infer the number and features of speakers and obtained better speaker diarization results than conventional models.
Keywords :
audio recording; direction-of-arrival estimation; probability; speaker recognition; time-varying systems; DOA-based diarization; audio recordings; conversation recording; probabilistic model; probabilistic speaker clustering; speaker recognition; time-varying speaker proportion; Acoustic applications; Acoustic signal processing; Conferences; Direction of arrival estimation; Feature extraction; Laboratories; Loudspeakers; Microphone arrays; Speech; Working environment noise; Probabilistic clustering; direction of arrival; speaker diarization; variational Bayes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
Type :
conf
DOI :
10.1109/ASPAA.2009.5346517
Filename :
5346517
Link To Document :
بازگشت