Title :
Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling
Author :
May, Tobias ; van de Par, Steven ; Kohlrausch, Armin
Author_Institution :
Inst. of Phys., Univ. of Oldenburg, Oldenburg, Germany
Abstract :
Although the field of automatic speaker recognition (ASR) has been the subject of extensive research over the past decades, the lack of robustness against background noise has remained a major challenge. This paper describes a noise-robust speaker recognition system that combines missing data (MD) recognition with the adaptation of speaker models using a universal background model (UBM). For MD recognition, the identification of reliable and unreliable feature components is required. For this purpose, the signal-to-noise ratio (SNR) based mask estimation performance of various state-of-the art noise estimation techniques and noise reduction schemes is compared. Speaker recognition experiments show that the usage of a UBM in combination with missing data recognition yields substantial improvements in recognition performance, especially in the presence of highly non-stationary background noise at low SNRs.
Keywords :
speaker recognition; automatic speaker recognition; mask estimation; missing data recognition; missing data technique; noise-robust speaker recognition; nonstationary background noise; signal-to-noise ratio; speaker model; universal background modeling; Adaptation model; Data models; Estimation; Materials; Speaker recognition; Speech; Speech recognition; Automatic speaker recognition (ASR); mask estimation; mel frequency cepstral coefficient (MFCC); missing data; noise robustness; universal background model (UBM);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Conference_Location :
5/31/2011 12:00:00 AM
DOI :
10.1109/TASL.2011.2158309