DocumentCode :
1544785
Title :
Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
Author :
Vergin, Rivarol ; O´Shaughnessy, Douglas ; Farhat, Azarshid
Author_Institution :
INRS Telecommun., Ile des Soeurs, Que., Canada
Volume :
7
Issue :
5
fYear :
1999
fDate :
9/1/1999 12:00:00 AM
Firstpage :
525
Lastpage :
532
Abstract :
The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system
Keywords :
cepstral analysis; parameter estimation; speech recognition; confidence interval; energy zone; generalized mel frequency cepstral coefficients; input signal; interpolation; large-vocabulary continuous-speech recognition; optimality criteria; parameters; sentences; speaker-independent continuous-speech recognition; spectrum; text message; words; Acoustic applications; Cepstral analysis; Data mining; Discrete Fourier transforms; Filters; Linear predictive coding; Mel frequency cepstral coefficient; Signal processing; Speech recognition; Speech synthesis;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.784104
Filename :
784104
Link To Document :
بازگشت