Weighted training for speech under Lombard Effect for speaker recognition

Author

Saleem, Muhammad Muneeb ; Gang Liu ; Hansen, John H. L.

Author_Institution

Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA

fYear

2015

fDate

19-24 April 2015

Firstpage

4350

Lastpage

4354

Abstract

The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent of the noise levels causing the Lombard Effect. Lombard Effect detection accuracies as high as 95.7% are achieved using this novel model. The deep neural network based classification is further exploited by validation based weighted training of robust i-Vector based speaker identification systems. The proposed weighted training achieves a relative EER improvement of 28.4% over an i-Vector baseline system, confirming the effectiveness of deep neural networks in modeling Lombard Effect.

Keywords

acoustic noise; neural nets; speaker recognition; speech intelligibility; Lombard effect; Lombard speech; deep neural networks; neutral speech; noise levels; relative EER improvement; robust i-Vector; speaker identification; speaker recognition; speech; Accuracy; Neural networks; Noise; Speech; Speech recognition; Training; Training data; Lombard Effect; deep neural networks; robust; speaker identification; weighted training;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178792

Filename

7178792