DocumentCode
573512
Title
Improving channel robustness in text-independent speaker verification using adaptive virtual cohort models
Author
Nautsch, Andreas ; Schönwandt, Anne ; Kasper, Klaus ; Reininger, Herbert ; Wagner, Martin
Author_Institution
Dept. of Comput. Sci., Univ. of Appl. Sci. Darmstadt, Darmstadt, Germany
fYear
2012
fDate
6-7 Sept. 2012
Firstpage
1
Lastpage
5
Abstract
In speaker verification, score normalization methods are a common practice to gain better performance and robustness. One kind of score normalization is cohort normalization, which uses information about the score behaviour of known impostors. During enrolment, impostor verifications are simulated to get a speaker-specific set of the most competitive impostors (the cohort). In the present paper, one virtual cohort speaker is synthesized using the most competitive impostor´s Hidden Markov Models (HMMs). These impostors are also users of the system and therefore their models have channel-specific information contrary to the universal background model, which provides channel- and speaker-independent models. On verification, cohort scores are obtained by an additional verification of the virtual cohort speaker. The cohort scores evaluate the candidate as an impostor. A cohort normalized score promises greater robustness. This paper will study the effect of the introduced cohort normalization technique on the speaker verification system atip VoxGuard, which is based on mel-frequency cepstral coefficients and HMMs. VoxGuard can be used as either a text-dependent or a text-independent verification system. In this paper, emphasis is placed on text-independent speaker verification. Experiments using the atip speech corpus and the SieTill speech corpus showed improvements measured by the equal error rate on performance and robustness.
Keywords
cepstral analysis; hidden Markov models; performance evaluation; speaker recognition; speech synthesis; HMM; Mel-frequency cepstral coefficients; SieTill speech corpus; adaptive virtual cohort models; atip VoxGuard; atip speech corpus; channel robustness; channel-independent model; channel-specific information; cohort normalization technique; hidden Markov models; impostor verifications; performance improvement; score behaviour; score normalization methods; speaker-independent model; text-independent speaker verification; virtual cohort speaker synthesis; Adaptation models; Biological system modeling; Educational institutions; Hidden Markov models; Robustness; Speech; Speech processing; cohort-based; speaker verification; text-independent;
fLanguage
English
Publisher
ieee
Conference_Titel
Biometrics Special Interest Group (BIOSIG), 2012 BIOSIG - Proceedings of the International Conference of the
Conference_Location
Darmstadt
ISSN
1617-5468
Print_ISBN
978-1-4673-1010-9
Type
conf
Filename
6313558
Link To Document