• DocumentCode
    2265747
  • Title

    Comparison of channel normalisation techniques for automatic speech recognition over the phone

  • Author

    De Veth, Johan ; Boves, Louis

  • Author_Institution
    Dept. of Language & Speech, Nijmegen Univ., Netherlands
  • Volume
    4
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    2332
  • Abstract
    We compared three different channel normalisation (CN) methods in the context of a connected digit recognition task over the phone: cepstrum mean substraction (CMS), RASTA filtering and the Gaussian dynamic cepstrum representation (GDCR). Using a small set of context independent (CI) continuous Gaussian mixture hidden Markov models (HMMs), we found that CMS and RASTA outperformed the GDCR technique. We show that the main cause for the superiority of CMS compared to RASTA is the phase distortion introduced by the RASTA filter. Recognition results for a phase corrected RASTA technique are identical to those of CMS. Our results indicate that an ideal cepstrum based CN method should: (1) effectively remove the DC component; (2) at least preserve modulation frequencies in the range 2-16 Hz; and (3) introduce no phase distortion in case CI HMMs are used for recognition
  • Keywords
    filtering theory; hidden Markov models; speech processing; speech recognition; telecommunication channels; telecommunication computing; telephony; GDCR technique; Gaussian dynamic cepstrum representation; RASTA filter; RASTA filtering; automatic speech recognition; cepstrum based CN method; cepstrum mean substraction; channel normalisation methods; channel normalisation techniques; connected digit recognition task; context independent continuous Gaussian mixture hidden Markov models; modulation frequencies; phase corrected RASTA technique; phase distortion; phone; Automatic speech recognition; Cepstrum; Collision mitigation; Filtering; Hidden Markov models; Nonlinear filters; Phase distortion; Signal processing; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607275
  • Filename
    607275