Title :
Formant analysis using mixtures of Gaussians
Author :
Zolfaghari, P. ; Robinson, Tony
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
The paper describes a new formant analysis technique whereby the formant parameters are represented in the form of Gaussian mixture distributions. These are estimated from the discrete Fourier transform (DFT) magnitude spectrum of the speech signal. The parameters obtained are the means, variances and the masses of the density functions, which are used to calculate centre frequencies, bandwidths and amplitudes of formants within the spectrum. In order to better fit the mixture distributions various modifications to the DFT magnitude spectrum, based on simple models of perception, were investigated. These include reduction of dynamic range, cepstral smoothing, use of the Mel scale and pre-emphasis of speech. Results are presented for these as well as formant tracks from analysing speech using the final formant analysis system
Keywords :
Gaussian distribution; discrete Fourier transforms; spectral analysis; speech processing; Gaussian mixture distributions; Mel scale; amplitudes; bandwidths; centre frequencies; cepstral smoothing; density function mass; discrete Fourier transform magnitude spectrum; dynamic range reduction; formant analysis; formant parameters; formant tracks; mean; perception models; speech analysis; speech pre-emphasis; speech signal; variance; Bandwidth; Cepstral analysis; Density functional theory; Discrete Fourier transforms; Dynamic range; Frequency; Gaussian distribution; Gaussian processes; Smoothing methods; Speech analysis;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607830