Title :
Smoothness analysis for trajectory features
Author :
Hu, Zhihong ; Barnard, Etienne
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Abstract :
Dynamic modeling of speech is potentially a major improvement on hidden Markov models (HMMs). In one approach, trajectory models are used to model the dynamics of the spectrum, and are used as basis for classification. Although some improvement has been achieved in this way, one would hope for more substantial improvements given that the independence assumption is removed. One reason why this was not achieved may be that the trajectory models are based on cepstral coefficients; we show that these tracks contain spurious oscillations. This suggests that these trajectory features might have a high within-class variance. We introduce a measure of evaluating the smoothness of trajectory-based features. This measure provides a method of selecting the best of a set of similar features. Formant trajectories prove to be significantly smoother than trajectories of mel scale cepstral coefficients (MFCC) by this measure, but this does not translate directly to improved performance
Keywords :
cepstral analysis; feature extraction; hidden Markov models; smoothing methods; speech processing; speech recognition; HMM; cepstral coefficients; classification; dynamic modeling; formant trajectories; hidden Markov models; mel scale cepstral coefficients; smoothness analysis; speech modeling; speech recognition; spurious oscillations; trajectory features; trajectory models; Acoustics; Cepstral analysis; Hidden Markov models; Loudspeakers; Mel frequency cepstral coefficient; Natural languages; Polynomials; Signal analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596103