• DocumentCode
    2800045
  • Title

    Boosted binary features for noise-robust speaker verification

  • Author

    Roy, Anindya ; Magimai-Doss, Mathew ; Marcel, Sébastien

  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4442
  • Lastpage
    4445
  • Abstract
    The standard approach to speaker verification is to extract cepstral features from the speech spectrum and model them by generative or discriminative techniques. We propose a novel approach where a set of client-specific binary features carrying maximal discriminative information specific to the individual client are estimated from an ensemble of pair-wise comparisons of frequency components in magnitude spectra, using Adaboost algorithm. The final classifier is a simple linear combination of these selected features. Experiments on the XM2VTS database strictly according to a standard evaluation protocol have shown that although the proposed framework yields comparatively lower performance on clean speech, it significantly outperforms the state-of-the-art MFCC-GMM system in mismatched conditions with training on clean speech and testing on speech corrupted by four types of additive noise from the standard Noisex-92 database.
  • Keywords
    feature extraction; pattern classification; signal denoising; speaker recognition; speech processing; Adaboost algorithm; Noisex-92 database; XM2VTS database; additive noise; boosted binary features; cepstral feature extraction; client-specific binary features; discriminative techniques; final classifier; frequency components; generative techniques; linear combination; magnitude spectra; maximal discriminative information; noise-robust speaker verification; pair-wise comparisons; speech corruption; speech spectrum; standard evaluation protocol; Additive noise; Cepstral analysis; Data mining; Feature extraction; Frequency estimation; Noise robustness; Protocols; Spatial databases; Speech analysis; Speech enhancement; Adaboost; Speaker verification; binary features; noise robustness; speaker-specific features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495622
  • Filename
    5495622