Title :
Spectral normalization employing hidden Markov modeling of line spectrum pair frequencies
Author :
Pellom, Bryan L. ; Hansen, John H L
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Abstract :
This paper proposes a spectral normalization approach in which the acoustical qualities of an input speech waveform are mapped onto that of a desired neutral voice. Such a method can be effective in reducing the impact of speaker variability such as accent, stress, and emotion for speech recognition. In the proposed method, the transformation is performed by modeling the temporal characteristics of the line spectrum pair (LSP) frequencies of the neutral voice using hidden Markov models. The overall approach is integrated into a pitch synchronous overlap and add (PSOLA) analysis/synthesis framework. The algorithm is objectively evaluated using a distance measure based on the log-likelihood of observing the input (or normalized input) speech given Gaussian mixture speaker models for both the input and desired neutral voice. Results using the Gaussian mixture model formulated criteria demonstrate consistent normalization using a 10 speaker database
Keywords :
Gaussian processes; acoustic signal processing; hidden Markov models; spectral analysis; speech processing; speech recognition; speech synthesis; Gaussian mixture speaker models; PSOLA analysis/synthesis; accent; acoustical qualities; distance measure; emotion; hidden Markov modeling; input speech; input speech waveform; line spectrum pair frequencies; log-likelihood; neutral voice; pitch synchronous overlap and add framework; speaker database; speaker variability; spectral normalization; speech recognition; stress; temporal characteristics modeling; Frequency; Hidden Markov models; Laboratories; Loudspeakers; Robustness; Speech analysis; Speech processing; Speech recognition; Speech synthesis; Stress;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596092