DocumentCode
302070
Title
On the use of residual cepstrum in speech recognition
Author
He, Jialong ; Liu, Li ; Palm, Günther
Author_Institution
Ulm Univ., Germany
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
5
Abstract
In speech recognition based on LPC analysis the prediction residues are usually ignored, only the LPC-derived cepstral coefficients (LPCC) are used to compose feature vectors. In this study, a number of parameters (called the residual cepstrum or RCEP) were calculated from these residues and their effectiveness for speech recognition was evaluated. It was shown that the RCEP do contain useful information, in particular, they are complementary to the LPCC. In an evaluation experiment, if the LPCC were used jointly with a few RCEP coefficients, the recognition rate of the English E-set letters was improved from 54% to 67% and from 69% to 71% by the use of HMMs based recognizer and the DTW based recognizer, respectively. In addition, Mel-scaled FFT based cepstrum (MFCC) was found to be superior to LPCC
Keywords
cepstral analysis; fast Fourier transforms; hidden Markov models; linear predictive coding; natural languages; speech coding; speech processing; speech recognition; DTW based recognizer; English E-set letters; HMM based recognizer; LPC analysis; LPC derived cepstral coefficients; MFCC; Mel-scaled FFT based cepstrum; RCEP coefficients; evaluation experiment; feature vector; prediction residues; recognition rate; residual cepstrum; Cepstral analysis; Cepstrum; Helium; Hidden Markov models; Linear predictive coding; Mel frequency cepstral coefficient; Parameter estimation; Poles and zeros; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.540276
Filename
540276
Link To Document