DocumentCode :
1686634
Title :
Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion
Author :
McLaren, Moray ; Scheffer, Nicolas ; Graciarena, Martin ; Ferrer, Luciana ; Yun Lei
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
fYear :
2013
Firstpage :
6773
Lastpage :
6777
Abstract :
This article describes our submission to the speaker identification (SID) evaluation for the first phase of the DARPA Robust Automatic Transcription of Speech (RATS) program. The evaluation focuses on speech data heavily degraded by channel effects. We show here how we designed a robust system using multiple streams of noise-robust features that were combined at a later stage in an i-vector framework. For all channels of interest, our combination strategy presents up to a 41% relative improvement in miss rate at a 4% false alarm rate with respect to the best-performing single-stream system.
Keywords :
speaker recognition; speech processing; DARPA robust automatic transcription of speech program; RATS program; SID evaluation; best-performing single-stream system; channel effects; channel-degraded speech; i-vector framework; multiple system fusion; noise-robust features; robust system; speaker identification evaluation; speaker identification robustness; speech data; Conferences; Feature extraction; Hidden Markov models; Rats; Robustness; Speech; Speech recognition; degraded speech; i-vector; speaker verification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638973
Filename :
6638973
Link To Document :
بازگشت