DocumentCode
417170
Title
Speaker indexing and adaptation using speaker clustering based on statistical model selection
Author
Nishida, Masafumi ; Kawahara, Tatsuya
Author_Institution
Graduate Sch. of Sci. & Technol., Chiba Univ., Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper addresses unsupervised speaker indexing and automatic speech recognition of discussions. In speaker indexing, there are two cases, where the number of speakers is unknown beforehand and where the number is known. When the specified number is unknown, it is difficult to apply to various data because it needs to determine several parameters like threshold. In addition, serious problems arise in applying a uniform model because variations in the utterance durations of speakers are large. We thus propose a method which can robustly perform speaker indexing for the two cases using a flexible framework in which an optimal speaker model (GMM or VQ) is selected based on the BIC (Bayesian information criterion). Moreover, we propose a combination method of speaker adaptation based on speaker selection and the indexing method. For real discussion archives, we demonstrated that indexing performance is higher than that of conventional methods for the two cases and speech recognition performance was improved by the combination method.
Keywords
Bayes methods; Gaussian processes; signal classification; speaker recognition; speech recognition; statistical analysis; vector quantisation; BIC; Bayesian information criterion; GMM; VQ; automatic speech recognition; optimal speaker model; speaker adaptation; speaker clustering; statistical model selection; unsupervised speaker indexing; utterance duration; Acoustic testing; Automatic speech recognition; Bayesian methods; Gaussian distribution; Indexing; Informatics; Loudspeakers; Robustness; Speech recognition; Voice mail;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325995
Filename
1325995
Link To Document