Title :
A time-varying analysis method for rapid transitions in speech
Author :
Nathan, Krishna S. ; Lee, Yi-Teh ; Silverman, Harvey F.
Author_Institution :
Div. of Eng., Brown Univ., Providence, RI, USA
fDate :
4/1/1991 12:00:00 AM
Abstract :
A linear predictive coding (LPC) model based on time-dependent poles which has yielded promising results when applied to synthetic data is applied to real speech data. The data are processed pitch-synchronously using a simple procedure to identify regions of the data that best fit the model. The maximum-likelihood technique, which has been found to be robust in the presence of noise, is used to estimate the parameters. Resulting formant estimates for several diphthongs are presented. The algorithm tracks the formants well, both in stable regions and in regions of transition. This ability to track formant variation within analysis intervals is a definite advantage over traditional LPC. Results from speech data involving final stop consonants are presented. Rapid changes, particularly in the first and second formants, in the region immediately prior to the stop are detected. Such abrupt transitions are often not detected by traditional time-invariant methods
Keywords :
parameter estimation; speech analysis and processing; LPC model; diphthongs; final stop consonants; formant estimates; linear predictive coding; maximum-likelihood technique; parameter estimation; pitch-synchronous processing; rapid transitions in speech; time-dependent poles; time-varying analysis method; Data mining; Difference equations; Linear predictive coding; Maximum likelihood detection; Maximum likelihood estimation; Noise robustness; Parameter estimation; Parameter extraction; Parametric statistics; Speech analysis;
Journal_Title :
Signal Processing, IEEE Transactions on