DocumentCode
2703040
Title
Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition
Author
Schluter, Ralf ; Bezrukov, L. ; Wagner, Hannes ; Ney, Hermann
Author_Institution
Dept. of Comput. Sci., RWTH Aachen Univ., Germany
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
In this work, an acoustic feature set based on a gammatone filterbank is introduced for large vocabulary speech recognition. The gammatone features presented here lead to competitive results on the EPPS English task, and considerable improvements were obtained by subsequent combination to a number of standard acoustic features, i.e. MFCC, PLP, MF-PLP, and VTLN plus voicedness. Best results were obtained when combining gammatone features to all other features using weighted ROVER, resulting in a relative improvement of about 12% in word error rate compared to the best single feature system. We also found that ROVER gives better results for feature combination than both log-linear model combination and LDA.
Keywords
channel bank filters; feature extraction; speech processing; speech recognition; EPPS English; acoustic feature; feature combination; gammatone features; gammatone filterbank; large vocabulary speech recognition; word error rate; Biology; Cepstral analysis; Computer science; Feature extraction; Filter bank; Frequency; Humans; IIR filters; Speech recognition; Vocabulary; acoustic feature combination; auditory systems; feature extraction; gammatone filterbank; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366996
Filename
4218184
Link To Document