• DocumentCode
    1193418
  • Title

    Recognition of coded speech transmitted over wireless channels

  • Author

    Gómez, Angel M. ; Peinado, Antonio M. ; Sánchez, Victoria ; Rubio, Antonio J.

  • Author_Institution
    Dept. Teoria de la Senal, Granada Univ.
  • Volume
    5
  • Issue
    9
  • fYear
    2006
  • fDate
    9/1/2006 12:00:00 AM
  • Firstpage
    2555
  • Lastpage
    2562
  • Abstract
    Network-based speech recognition (NSR) and distributed speech recognition (DSR) have been proposed as solutions to translate speech recognition technologies to mobile environments. NSR is the most straightforward solution since it does not require any modification in the mobile phone, however DSR offers higher robustness against codec compression and transmission channel degradation. This paper explores an alternative approach for remote speech recognition which combines the advantages of NSR and DSR. In this scheme, a standard speech codec is used for speech transmission but the recognition is performed from the received codec parameters. In particular, we focus on the effect of transmission channel errors, which can cause a more severe performance reduction on speech recognition than codec distortion. First, we show that an NSR solution can approach DSR through a reconstruction technique along with an adapted noise reduction technique originally proposed for acoustic noise. Then, these results are improved by working with recognition features directly extracted from the codec bitstream by means of parameter transcoding. Required modifications on current networks in order to access the bitstream are described. The network upgrading with the tandem free operation (TFO) protocol is an attractive solution. This upgrade not only offers an overall improvement on the end-to-end speech quality, but would also allow a recognition performance similar, and even higher in poor channel conditions, to that obtained by DSR when parameter transcoding along with the proposed mitigation techniques are applied
  • Keywords
    mobile radio; protocols; speech codecs; speech coding; speech recognition; transcoding; voice communication; wireless channels; acoustic noise; coded speech recognition; distributed speech recognition; end-to-end speech quality; mobile environments; network-based speech recognition; parameter transcoding; speech codec; speech transmission; tandem free operation protocol; wireless channels; Acoustic distortion; Acoustic noise; Code standards; Degradation; Mobile handsets; Noise reduction; Robustness; Speech codecs; Speech recognition; Transcoding;
  • fLanguage
    English
  • Journal_Title
    Wireless Communications, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1536-1276
  • Type

    jour

  • DOI
    10.1109/TWC.2006.1687779
  • Filename
    1687779