DocumentCode :
1544791
Title :
Use of spectral autocorrelation in spectral envelope linear prediction for speech recognition
Author :
Kim, Hong Kook ; Lee, Hwang Soo
Author_Institution :
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Seoul, South Korea
Volume :
7
Issue :
5
fYear :
1999
fDate :
9/1/1999 12:00:00 AM
Firstpage :
533
Lastpage :
541
Abstract :
This paper proposes a linear predictive (LP) analysis method where sample autocorrelations are estimated from the spectral envelope of a speech signal on the basis of the spectral autocorrelation. The spectral autocorrelation is defined as discrete quantities of speech spectrum with spectral resolution identical to the discrete Fourier transform (DFT) used to obtain the speech spectrum. From analytical and empirical derivation of its properties, we can estimate the fundamental frequency and the maximally correlated frequency for voiced and unvoiced speech, respectively, and then obtain the spectral envelope by sampling at a rate of the estimated frequency. A frequency normalization can be applied to the estimated spectral envelope because the number of samples of the spectral envelope usually differs from frame to frame. The spectral envelope is warped into the mel-frequency scale and the inverse DFT is applied to extract the estimate of sample autocorrelations. From the result of LP analysis on the sample autocorrelations, we finally obtain the spectral envelope cepstral coefficients (SECC). Hidden Markov model (HMM) recognition experiments show that SECC significantly improves the performance of a recognizer at low signal-to-noise ratios (SNRs) over several other representations
Keywords :
cepstral analysis; correlation methods; discrete Fourier transforms; frequency estimation; hidden Markov models; prediction theory; speech recognition; HMM recognition experiments; Hidden Markov model; discrete Fourier transform; frequency normalization; fundamental frequency; inverse DFT; low signal-to-noise ratios; maximally correlated frequency; mel-frequency scale; sampling; spectral autocorrelation; spectral envelope cepstral coefficients; spectral envelope linear prediction; spectral resolution; speech recognition; speech signal; unvoiced speech; voiced speech; Autocorrelation; Cepstral analysis; Discrete Fourier transforms; Frequency estimation; Hidden Markov models; Sampling methods; Signal analysis; Signal resolution; Signal to noise ratio; Speech analysis;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.784105
Filename :
784105
Link To Document :
بازگشت