DocumentCode
395197
Title
Speaker detection using multi-speaker audio files for both enrollment and test
Author
Bonastre, Jean-François ; Meignier, Sylvuin ; Merlin, Tevu
Author_Institution
LIA-Avignon, Avignon, France
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
This paper focuses on speaker detection using multispeaker files both for the enrollment phase and for the test phase. This task was introduced during the 2002 NIST speaker recognition evaluation campaign. Enrollment data is composed of three two-speaker files. Test files are also two-speaker records. The system presented here uses a speaker segmentation process based on an HMM conversation model followed by a speaker matching technique to produce one-speaker segments. Speaker detection is then achieved using AMIRAL, LIA´s GMM-based speaker verification system. Validation of the proposed strategy is done using extracts from the NIST 2002 results.
Keywords
hidden Markov models; speaker recognition; 2002 NIST speaker recognition evaluation; AMIRAL; HMM conversation model; LIA GMM-based speaker verification system; enrollment data; multispeaker audio files; one-speaker segments; speaker detection; speaker matching technique; speaker segmentation; test data; two-speaker files; two-speaker records; Data mining; Hidden Markov models; Iterative decoding; Loudspeakers; NIST; Phase detection; Speaker recognition; Speech; System testing; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202298
Filename
1202298
Link To Document