Title :
Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task
Author :
Pellom, Bryan ; Hacioglu, Kadri
Author_Institution :
Univ. of Colorado at Boulder, CO, USA
Abstract :
We report on recent improvements in the University of Colorado system for the DARPA/NRL Speech in Noisy Environments (SPINE) task. In particular, we describe our efforts on improving acoustic and language modeling for the task and investigate methods for unsupervised speaker and environment adaptation from limited data. We show that the MAPLR adaptation method outperforms single and multiple regression class MLLR on the SPINE task. Our current SPINE system uses the Sonic speech recognition engine that was developed at the University of Colorado. This system is shown to have a word error rate of 31.5% on the SPINE-2 evaluation data. These improvements amount to a 16% reduction in relative word error rate compared to our previous SPINE-2 system fielded in the November 2001 DARPA/NRL evaluation.
Keywords :
acoustic signal processing; adaptive signal processing; iterative methods; natural languages; noise; speech recognition; unsupervised learning; CU Sonic ASR system; DARPA/NRL Speech in Noisy Environments; MAPLR adaptation method; SPINE task; SPINE-2 evaluation data; Sonic speech recognition engine; University of Colorado; acoustic modeling; language modeling; multiple regression class MLLR; noisy speech; unsupervised environment adaptation; unsupervised speaker adaptation; word error analysis; word error rate reduction; Acoustic noise; Automatic speech recognition; Colored noise; Engines; Error analysis; Loudspeakers; Maximum likelihood linear regression; Natural languages; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198702