DocumentCode
2881058
Title
A new speaker change detection method for two-speaker segmentation
Author
Adam, Andre G. ; Kajarekar, Sachin S. ; Hermansky, Hynek
Author_Institution
OGI School of Science and Engineering, Oregon Health and Science University, Portland, USA
Volume
4
fYear
2002
fDate
13-17 May 2002
Abstract
In absence of prior information about speakers, an important step in speaker segmentation is to obtain initial estimates for training speaker models. In this paper, we present a new method for obtaining these estimates. The method assumes that a conversation must be initiated by one of the speakers. Thus one speaker model is estimated from the small segment at the beginning of the conversation and the segment that has the largest distance from the initial segment is used to train second speaker model. We describe a system based on this method and evaluate it on two different tasks: a controlled task with variations in the duration of the initial speaker segment and amount of overlapped speech and 2001 NIST Speaker Recognition Evaluation task that contains natural conversations. This system shows significant improvements over the conventional system in absence of overlapped speech on the controlled task.
Keywords
Bayesian methods; Cepstral analysis; Computational modeling; Databases; NIST; Silicon; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5745511
Filename
5745511
Link To Document