• DocumentCode
    3424093
  • Title

    Hierarchical spectro-temporal features for robust speech recognition

  • Author

    Domont, Xavier ; Heckmann, Martin ; Joublin, Frank ; Goerick, Christian

  • Author_Institution
    Honda Res. Inst. Eur. GmbH, Offenbach am Main
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4417
  • Lastpage
    4420
  • Abstract
    Previously we presented an auditory-inspired feed-forward architecture which achieves good performance in noisy conditions on a segmented word recognition task. In this paper we propose to use a modified version of this hierarchical model to generate features for standard hidden Markov models. To obtain these features we firstly compute the spectrograms using a Gammatone filterbank. A filtering over the channels permits to enhance the formant frequencies which are afterwards detected using Gabor-like receptive fields. Then the responses of the receptive fields are combined to complex features which span the whole frequency range and extend over three different time windows. The features have been evaluated on a single digit recognition task. The results show that their combination with MFCCs or RASTA features yields improved recognition scores in noise.
  • Keywords
    hidden Markov models; speech recognition; Gabor-like receptive fields; Gammatone filterbank; auditory-inspired feedforward architecture; hidden Markov models; hierarchical spectro-temporal features; robust speech recognition; segmented word recognition task; single digit recognition task; Computer architecture; Feedforward systems; Filter bank; Filtering; Frequency; Gabor filters; Hidden Markov models; Robustness; Spectrogram; Speech recognition; Robust features; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518635
  • Filename
    4518635