Title :
Feature normalization for speaker verification in room reverberation
Author :
Ganapathy, Sriram ; Pelecanos, Jason ; Omar, Mohamed Kamal
Author_Institution :
Dept. of ECE, Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
The performance of a typical speaker verification system degrades significantly in reverberant environments. This degradation is partly due to the conventional feature extraction/compensation techniques that use analysis windows which are much shorter than typical room impulse responses. In this paper, we present a feature extraction technique which estimates long-term envelopes of speech in narrow sub-bands using frequency domain linear prediction (FDLP). When speech is corrupted by reverberation, the long-term sub-band envelopes are convolved in time with those of the room impulse response function. In a first order approximation, gain normalization of these envelopes in the FDLP model suppresses the room reverberation artifacts. Experiments are performed on the 8 core conditions of the NIST 2008 speaker recognition evaluation (SRE). In these experiments, the FDLP features provide significant improvements on the interview microphone conditions (relative improvements of 20 30%) over the corresponding baseline system with MFCC features.
Keywords :
feature extraction; frequency-domain analysis; speaker recognition; FDLP model; MFCC; SRE; feature extraction-compensation techniques; feature normalization; frequency domain linear prediction; room reverberation; speaker recognition evaluation; speaker verification; Discrete cosine transforms; Feature extraction; Mel frequency cepstral coefficient; Reverberation; Speech; Training; Frequency Domain Linear Prediction (FDLP); Room Reverberation; Speaker Verification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947438