Title :
Mitigation of channel errors in EFR-based speech recognition
Author :
Gomez, Angel M. ; Peinado, Antonio M. ; Sanchez, Victoria ; Perez-Curdoba, J.L. ; Rubio, Antonio J.
Author_Institution :
Dept. de Electron. y Tecnologia de Computadores, Granada Univ., Spain
Abstract :
Network-based speech recognition (NSR) using the conventional speech channel with the enhanced full rate (EFR) or the adaptive multi-rate (AMR) codec is a very attractive approach since no change to existing mobile phones is needed. However, NSR reveals a degrading performance due to both transmission channel errors and the speech encoding process in comparison with distributed speech recognition (DSR), where speech features are efficiently coded and transmitted on a data channel. We focus on the degradation of the speech features caused by channel errors in an NSR system and propose methods to improve the quality of these features. Applying these methods, it turns out that the performance of an NSR system based on EFR coding is comparable to that based on DSR.
Keywords :
cellular radio; speech codecs; speech coding; speech recognition; telecommunication channels; adaptive multi-rate codec; cellular networks; data channel; distributed speech recognition; enhanced full rate codec; mobile phones; network-based speech recognition; speech encoding; transmission channel error mitigation; Codecs; Data mining; Degradation; GSM; Hardware; Hidden Markov models; Network servers; Speech coding; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326162