Title :
Weighted training for speech under Lombard Effect for speaker recognition
Author :
Saleem, Muhammad Muneeb ; Gang Liu ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
The presence of Lombard Effect in speech is proven to have severe effects on the performance of speech systems, especially speaker recognition. Varying kinds of Lombard speech are produced by speakers under influence of varying noise types [1]. This study proposes a high-accuracy classifier using deep neural networks for detecting various kinds of Lombard speech against neutral speech, independent of the noise levels causing the Lombard Effect. Lombard Effect detection accuracies as high as 95.7% are achieved using this novel model. The deep neural network based classification is further exploited by validation based weighted training of robust i-Vector based speaker identification systems. The proposed weighted training achieves a relative EER improvement of 28.4% over an i-Vector baseline system, confirming the effectiveness of deep neural networks in modeling Lombard Effect.
Keywords :
acoustic noise; neural nets; speaker recognition; speech intelligibility; Lombard effect; Lombard speech; deep neural networks; neutral speech; noise levels; relative EER improvement; robust i-Vector; speaker identification; speaker recognition; speech; Accuracy; Neural networks; Noise; Speech; Speech recognition; Training; Training data; Lombard Effect; deep neural networks; robust; speaker identification; weighted training;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178792