مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speaker recognition against background noise in an enhanced multi-condition domain

DocumentCode :

1357414

Title :

Robust speaker recognition against background noise in an enhanced multi-condition domain

Author :

Kim, Kichul ; Kim, Moo Young

Author_Institution :

Dept. of Inf. & Commun. Eng., Sejong Univ., Seoul, South Korea

Volume :

Issue :

fYear :

2010

Firstpage :

1684

Lastpage :

1688

Abstract :

In the midst of background noise environments, the performance of speaker recognition (SR) systems is considerably degraded. To estimate the model mismatch between training and evaluation data, we also propose an intra Kullback-Leibler distance (intra-KLD) measure. Based on the intra-KLD, the performance of SR systems using speech enhancement (SE) and multi-condition (MC) training can be predicted with reduced computational complexity. Since SE cannot fully remove real-world noise without modifying the clean speech signal, the SR model trained only with a clean speech signal cannot fully represent the evaluation data that include various noisy signals preprocessed by SE. To compensate for this problem, we apply SE as a preprocessing block not only for the evaluation stage, but for the training stage. Moreover, we propose to combine SE and MC training (SE-MC) where various sets of features are extracted in an SE domain and a model for each speaker is trained based on the mixture of SE-domain features. Under various background noise environments, SE, MC, and SE-MC produced SR error rates of 43.51%, 25.00%, and 20.29%, respectively.

Keywords :

computational complexity; feature extraction; speaker recognition; speech enhancement; background noise environments; clean speech signal; computational complexity; enhanced multicondition domain; feature extraction; intra Kullback-Leibler distance; intra-KLD measure; multicondition training; noisy signals; speaker recognition; speech enhancement; Error analysis; Noise; Noise measurement; Speech; Speech enhancement; Strontium; Training; Speaker recognition, speech enhancement, multi-condition training, noise estimation;

fLanguage :

English

Journal_Title :

Consumer Electronics, IEEE Transactions on

Publisher :

ieee

ISSN :

0098-3063

Type :

jour

DOI :

10.1109/TCE.2010.5606313

Filename :

5606313

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1357414