DocumentCode
417304
Title
Mitigation of channel errors in EFR-based speech recognition
Author
Gomez, Angel M. ; Peinado, Antonio M. ; Sanchez, Victoria ; Perez-Curdoba, J.L. ; Rubio, Antonio J.
Author_Institution
Dept. de Electron. y Tecnologia de Computadores, Granada Univ., Spain
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Network-based speech recognition (NSR) using the conventional speech channel with the enhanced full rate (EFR) or the adaptive multi-rate (AMR) codec is a very attractive approach since no change to existing mobile phones is needed. However, NSR reveals a degrading performance due to both transmission channel errors and the speech encoding process in comparison with distributed speech recognition (DSR), where speech features are efficiently coded and transmitted on a data channel. We focus on the degradation of the speech features caused by channel errors in an NSR system and propose methods to improve the quality of these features. Applying these methods, it turns out that the performance of an NSR system based on EFR coding is comparable to that based on DSR.
Keywords
cellular radio; speech codecs; speech coding; speech recognition; telecommunication channels; adaptive multi-rate codec; cellular networks; data channel; distributed speech recognition; enhanced full rate codec; mobile phones; network-based speech recognition; speech encoding; transmission channel error mitigation; Codecs; Data mining; Degradation; GSM; Hardware; Hidden Markov models; Network servers; Speech coding; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326162
Filename
1326162
Link To Document