DocumentCode :
1749681
Title :
Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
Author :
Mori, Kazumasa ; Nakagawa, Seiichi
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Aichi, Japan
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
413
Abstract :
Addresses the problem of the detection of speaker changes and clustering speakers when no information is available regarding speaker classes or even the total number of classes. We assume that no previous information on speakers is available (no speaker model, no training phase) and that people do not speak simultaneously. The aim is to apply speaker grouping information to speaker adaptation for speech recognition. We use vector quantization (VQ) distortion as the criterion. A speaker model is created from successive utterances as a codebook by a VQ algorithm, and the VQ distortion is calculated between the model and an utterance. A result was obtained by the experiment on speaker detection and speaker clustering. The speaker change detection experiment was compared with results by generalized likelihood ratio and Bayesian information criterion. We show the superiority of our proposed method
Keywords :
covariance matrices; speaker recognition; vector quantisation; Bayesian information criterion; VQ distortion; broadcast news speech recognition; codebook; generalized likelihood ratio; speaker adaptation; speaker change detection; speaker clustering; speaker grouping information; vector quantization distortion; Acoustic distortion; Bayesian methods; Broadcast technology; Broadcasting; Change detection algorithms; Clustering algorithms; Loudspeakers; Magnetooptic recording; Speech recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940855
Filename :
940855
Link To Document :
بازگشت