Speaker detection using multi-speaker audio files for both enrollment and test

Author

Bonastre, Jean-François ; Meignier, Sylvuin ; Merlin, Tevu

Author_Institution

LIA-Avignon, Avignon, France

Volume

2

fYear

2003

fDate

6-10 April 2003

Abstract

This paper focuses on speaker detection using multispeaker files both for the enrollment phase and for the test phase. This task was introduced during the 2002 NIST speaker recognition evaluation campaign. Enrollment data is composed of three two-speaker files. Test files are also two-speaker records. The system presented here uses a speaker segmentation process based on an HMM conversation model followed by a speaker matching technique to produce one-speaker segments. Speaker detection is then achieved using AMIRAL, LIA´s GMM-based speaker verification system. Validation of the proposed strategy is done using extracts from the NIST 2002 results.

Keywords

hidden Markov models; speaker recognition; 2002 NIST speaker recognition evaluation; AMIRAL; HMM conversation model; LIA GMM-based speaker verification system; enrollment data; multispeaker audio files; one-speaker segments; speaker detection; speaker matching technique; speaker segmentation; test data; two-speaker files; two-speaker records; Data mining; Hidden Markov models; Iterative decoding; Loudspeakers; NIST; Phase detection; Speaker recognition; Speech; System testing; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1202298

Filename

1202298