• DocumentCode
    2703040
  • Title

    Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition

  • Author

    Schluter, Ralf ; Bezrukov, L. ; Wagner, Hannes ; Ney, Hermann

  • Author_Institution
    Dept. of Comput. Sci., RWTH Aachen Univ., Germany
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    In this work, an acoustic feature set based on a gammatone filterbank is introduced for large vocabulary speech recognition. The gammatone features presented here lead to competitive results on the EPPS English task, and considerable improvements were obtained by subsequent combination to a number of standard acoustic features, i.e. MFCC, PLP, MF-PLP, and VTLN plus voicedness. Best results were obtained when combining gammatone features to all other features using weighted ROVER, resulting in a relative improvement of about 12% in word error rate compared to the best single feature system. We also found that ROVER gives better results for feature combination than both log-linear model combination and LDA.
  • Keywords
    channel bank filters; feature extraction; speech processing; speech recognition; EPPS English; acoustic feature; feature combination; gammatone features; gammatone filterbank; large vocabulary speech recognition; word error rate; Biology; Cepstral analysis; Computer science; Feature extraction; Filter bank; Frequency; Humans; IIR filters; Speech recognition; Vocabulary; acoustic feature combination; auditory systems; feature extraction; gammatone filterbank; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366996
  • Filename
    4218184