DocumentCode
2800045
Title
Boosted binary features for noise-robust speaker verification
Author
Roy, Anindya ; Magimai-Doss, Mathew ; Marcel, Sébastien
fYear
2010
fDate
14-19 March 2010
Firstpage
4442
Lastpage
4445
Abstract
The standard approach to speaker verification is to extract cepstral features from the speech spectrum and model them by generative or discriminative techniques. We propose a novel approach where a set of client-specific binary features carrying maximal discriminative information specific to the individual client are estimated from an ensemble of pair-wise comparisons of frequency components in magnitude spectra, using Adaboost algorithm. The final classifier is a simple linear combination of these selected features. Experiments on the XM2VTS database strictly according to a standard evaluation protocol have shown that although the proposed framework yields comparatively lower performance on clean speech, it significantly outperforms the state-of-the-art MFCC-GMM system in mismatched conditions with training on clean speech and testing on speech corrupted by four types of additive noise from the standard Noisex-92 database.
Keywords
feature extraction; pattern classification; signal denoising; speaker recognition; speech processing; Adaboost algorithm; Noisex-92 database; XM2VTS database; additive noise; boosted binary features; cepstral feature extraction; client-specific binary features; discriminative techniques; final classifier; frequency components; generative techniques; linear combination; magnitude spectra; maximal discriminative information; noise-robust speaker verification; pair-wise comparisons; speech corruption; speech spectrum; standard evaluation protocol; Additive noise; Cepstral analysis; Data mining; Feature extraction; Frequency estimation; Noise robustness; Protocols; Spatial databases; Speech analysis; Speech enhancement; Adaboost; Speaker verification; binary features; noise robustness; speaker-specific features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495622
Filename
5495622
Link To Document