• DocumentCode
    353542
  • Title

    Comparing acoustic features for robust ASR in fixed and cellular network applications

  • Author

    de Wet, Febe ; Cranen, Bert ; De Veth, Johan ; Boves, L.

  • Author_Institution
    Dept. of Language & Speech, Nijmegen Univ., Netherlands
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1415
  • Abstract
    Within the context of automatic speech recognition (ASR) applications for telephony, we investigate the acoustic preprocessing issues that are at stake in going from the fixed line to the cellular network. Because the spectral representation used in enhanced full rate GSM is linear prediction, we investigate the relative advantages and drawbacks of conventional mel-frequency cepstral coefficient (MFCC) parameters derived from a non-parametric fast Fourier transform (FFT) and MFCC parameters derived from a linear predictive coding (LPC) spectral estimate. Robust formant parameters, also derived from an LPC description of the spectrum, are studied as an alternative to MFCCs. Within the framework of connected digit recognition based on hidden Markov models, ASR performance was measured for clean conditions, as well as for three different additive noise conditions. In addition, the performance of a conventional recognition procedure was compared with the performance of an ASR system based on our acoustic backing-off implementation of missing feature theory (MFT)
  • Keywords
    acoustic noise; cellular radio; cepstral analysis; fast Fourier transforms; hidden Markov models; linear predictive coding; speech coding; speech recognition; LPC spectral estimate; MFCC parameter; acoustic backing-off implementation; acoustic features; acoustic preprocessing; additive noise conditions; automatic speech recognition; cellular network applications; connected digit recognition; enhanced full rate GSM; fixed network applications; hidden Markov models; linear predictive coding spectral estimate; mel-frequency cepstral coefficient parameters; missing feature theory; nonparametric fast Fourier transform; robust ASR; robust formant parameters; spectral representation; Acoustic applications; Automatic speech recognition; Cepstral analysis; Fast Fourier transforms; GSM; Land mobile radio cellular systems; Linear predictive coding; Mel frequency cepstral coefficient; Robustness; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861856
  • Filename
    861856