DocumentCode
1544785
Title
Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
Author
Vergin, Rivarol ; O´Shaughnessy, Douglas ; Farhat, Azarshid
Author_Institution
INRS Telecommun., Ile des Soeurs, Que., Canada
Volume
7
Issue
5
fYear
1999
fDate
9/1/1999 12:00:00 AM
Firstpage
525
Lastpage
532
Abstract
The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system
Keywords
cepstral analysis; parameter estimation; speech recognition; confidence interval; energy zone; generalized mel frequency cepstral coefficients; input signal; interpolation; large-vocabulary continuous-speech recognition; optimality criteria; parameters; sentences; speaker-independent continuous-speech recognition; spectrum; text message; words; Acoustic applications; Cepstral analysis; Data mining; Discrete Fourier transforms; Filters; Linear predictive coding; Mel frequency cepstral coefficient; Signal processing; Speech recognition; Speech synthesis;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.784104
Filename
784104
Link To Document