Title :
Maximum likelihood spectral estimation and its application to narrow-band speech coding
Author :
McAulay, Robert J.
Author_Institution :
Massachusetts Institute of Technology, Lexington, MA
fDate :
4/1/1984 12:00:00 AM
Abstract :
Itakura and Saito [1] used the maximum likelihood (ML) method to derive a spectral matching criterion for autoregressive (i.e., all-pole) random processes. In this paper, their results are generalized to periodic processes having arbitrary model spectra. For the all-pole model, Kay´s [2] covariance domain solution to the recursive ML (RML) problem is cast into the spectral domain and used to obtain the RML solution for periodic processes. When applied to speech, this leads to a method for solving the joint pitch and spectrum envelope estimation problems. It is shown that if the number of frequency power measurements greatly exceeds the model order, then the RML algorithm reduces to a pitch-directed, frequency domain version of linear predictive (LP) spectral analysis. Experiments on a real-time vocoder reveals that the RML synthetic speech has the quality of being heavily smoothed.
Keywords :
Frequency; Linear predictive coding; Maximum likelihood estimation; Narrowband; Power measurement; Random processes; Spectral analysis; Speech analysis; Speech coding; Speech processing;
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
DOI :
10.1109/TASSP.1984.1164318