DocumentCode :
2955576
Title :
Minimum Phoneme Error based Filter Bank Analysis for Speech Recognition
Author :
Huang, Hao ; Zhu, Jie
Author_Institution :
Dept. of Electron. Eng., Shanghai Jiao Tong Univ.
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
1081
Lastpage :
1084
Abstract :
In this paper the optimal filter-bank design method based on the minimum phone error (MPE) criteria is investigated. We use Gaussian type filter bank for optimization and various parameters of the filters such as gain, bandwidth and center frequency are trained aiming at maximize the MPE objective function to reduce word error. Preliminary experimental results on a large vocabulary continuous Mandarin speech recognition task given in this paper showed that, compared with both the untrained Gaussian type filters and traditional triangle shaped filter bank, cepstral coefficients derived from the optimized filter bank parameters result in a superior performance for word accuracy. The filters consistent with the MPE criteria are also illustrated
Keywords :
Gaussian processes; cepstral analysis; channel bank filters; natural languages; optimisation; speech recognition; vocabulary; MPE-based Gaussian type filter bank; Mandarin speech recognition task; cepstral coefficient; minimum phoneme error criteria; optimization; vocabulary system; Bandwidth; Cepstral analysis; Channel bank filters; Discrete cosine transforms; Error analysis; Filter bank; Mel frequency cepstral coefficient; Optimization methods; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262722
Filename :
4036791
Link To Document :
بازگشت