• DocumentCode
    3246187
  • Title

    Review of AMR speech codec-and distributed speech recognition-based speech-enabled services

  • Author

    Kiss, Imre ; Lakaniem, Ari ; Yang, Cao ; Viikki, ONi

  • Author_Institution
    Audio-Visual Syst. Lab., Nokia Res. Center, Tampere, Finland
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    613
  • Lastpage
    618
  • Abstract
    In this paper, we investigate the usefulness of general-purpose speech codecs and dedicated speech recognition codecs for speech-enabled services. Specifically, we focus on 3rd generation WCDMA systems using the adaptive multi-rate (AMR) speech codec, in comparison with the distributed speech recognition (DSR) framework. Speech recognition experiments are carried out with the AMR speech codec in a simulated packet-switched network. The performance of the DSR codec is assumed to be unaffected by transmission errors. Experimental results in British English and Mandarin Chinese indicate that no significant performance difference can be observed between the AMRand DSR-based recognition systems. The gain from using the dedicated DSR codec is unlikely to provide a perceptible improvement in terms of quality of service for the end-users. In the light of the experimental results achieved, and other implementation and economical issues, it is concluded that the use of dedicated speech recognition codecs, such as DSR, does not offer tangible benefits in real-world systems and services.
  • Keywords
    3G mobile communication; Internet telephony; code division multiple access; packet switching; quality of service; speech codecs; speech recognition; 3G WCDMA systems; AMR speech codec; DSR codec; VoIP transmission; adaptive multi-rate speech codec; distributed speech recognition framework; distributed speech-enabled services; packet-switched network; quality of service; speech recognition codecs; Adaptive systems; Audio-visual systems; Automatic speech recognition; Laboratories; Multiaccess communication; Partial response channels; Speech analysis; Speech codecs; Speech recognition; Target recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318510
  • Filename
    1318510