• DocumentCode
    3464845
  • Title

    On the use of filter-bank energies as features for robust speech recognition

  • Author

    Paliwal, K.K.

  • Author_Institution
    Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
  • Volume
    2
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    641
  • Abstract
    Though mel frequency cepstral coefficients (MFCCs) have been very successful in speech recognition, they have the following two problems: (1) they do not have any physical interpretation, and (2) liftering of cepstral coefficients, found to be highly useful in the earlier dynamic warping-based speech recognition systems, has no effect in the recognition process when used with continuous observation Gaussian density hidden Markov models. We propose to use the filter-bank energies (FBEs) as features. The FBEs are physically meaningful quantities and amenable for applying human auditory processing such as masking. We describe procedures to decorrelate and lifter the FBEs and show that the FBEs perform at least as good as (and sometimes even better than) the MFCCs for robust speech recognition
  • Keywords
    Gaussian processes; cepstral analysis; channel bank filters; decorrelation; filtering theory; hearing; hidden Markov models; speech recognition; MFCC; cepstral coefficients liftering; continuous observation Gaussian density HMM; decorrelation; dynamic warping-based speech recognition systems; filter-bank energies; hidden Markov models; human auditory processing; masking; mel frequency cepstral coefficients; robust speech recognition; Australia; Automatic speech recognition; Cepstral analysis; Cepstrum; Decorrelation; Discrete cosine transforms; Hidden Markov models; Humans; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Its Applications, 1999. ISSPA '99. Proceedings of the Fifth International Symposium on
  • Conference_Location
    Brisbane, Qld.
  • Print_ISBN
    1-86435-451-8
  • Type

    conf

  • DOI
    10.1109/ISSPA.1999.815754
  • Filename
    815754