مرکز منطقه ای اطلاع رساني علوم و فناوري - DNN feature compensation for noise robust speaker verification

DocumentCode :

735090

Title :

DNN feature compensation for noise robust speaker verification

Author :

Du, Steven ; Xiong Xiao ; Eng Siong Chng

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ. (NTU), Singapore, Singapore

fYear :

2015

fDate :

12-15 July 2015

Firstpage :

871

Lastpage :

875

Abstract :

The speaker verification (SV) task has been an active area of research in the last thirty years. One of the recent research topics is on improving the robustness of SV system in challenging environments. This paper examines the robustness of current state of the art SV system against background noise corruptions. Specifically, we consider the scenario where the SV system is trained from noise free speech and tested on background noise corrupted speech. To improve robustness of the system, a deep neural networks (DNN) based feature compensation is proposed to enhance the cepstral features before the evaluation. The DNN is trained from parallel data of clean and noise corrupted speech which are aligned in the frame level. The training is achieved by minimizing the mean square error (MSE) between the DNN´s prediction and the target clean features. The trained network could predict the underlying clean features when given noisy features. Results on the benchmarking SRE 2010 female core task show that by using DNN based feature compensation, the equal error rate (EER) can be reduced in most of the times even when the test noise is unseen during DNN training. The relative EER reduction usually is in the range of 3% to 26%.

Keywords :

cepstral analysis; error statistics; mean square error methods; neural nets; speaker recognition; speech processing; DNN based feature compensation; DNN feature compensation; EER reduction; MSE; SV system; background noise corrupted speech; background noise corruption; cepstral feature; clean corrupted speech; deep neural network; equal error rate; mean square error; noise free speech; noise robust speaker verification; parallel data; trained network; Feature extraction; Noise; Noise measurement; Robustness; Speech; Speech processing; Speech recognition; DNN; feature compensation; noise robustness; speaker verification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on

Conference_Location :

Chengdu

Type :

conf

DOI :

10.1109/ChinaSIP.2015.7230529

Filename :

7230529

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=735090