• DocumentCode
    417304
  • Title

    Mitigation of channel errors in EFR-based speech recognition

  • Author

    Gomez, Angel M. ; Peinado, Antonio M. ; Sanchez, Victoria ; Perez-Curdoba, J.L. ; Rubio, Antonio J.

  • Author_Institution
    Dept. de Electron. y Tecnologia de Computadores, Granada Univ., Spain
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Network-based speech recognition (NSR) using the conventional speech channel with the enhanced full rate (EFR) or the adaptive multi-rate (AMR) codec is a very attractive approach since no change to existing mobile phones is needed. However, NSR reveals a degrading performance due to both transmission channel errors and the speech encoding process in comparison with distributed speech recognition (DSR), where speech features are efficiently coded and transmitted on a data channel. We focus on the degradation of the speech features caused by channel errors in an NSR system and propose methods to improve the quality of these features. Applying these methods, it turns out that the performance of an NSR system based on EFR coding is comparable to that based on DSR.
  • Keywords
    cellular radio; speech codecs; speech coding; speech recognition; telecommunication channels; adaptive multi-rate codec; cellular networks; data channel; distributed speech recognition; enhanced full rate codec; mobile phones; network-based speech recognition; speech encoding; transmission channel error mitigation; Codecs; Data mining; Degradation; GSM; Hardware; Hidden Markov models; Network servers; Speech coding; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326162
  • Filename
    1326162