DocumentCode
337528
Title
Diphone multi-trajectory subspace models
Author
Reinhard, Klaus ; Niranjan, Mahesan
Author_Institution
Dept. of Eng., Cambridge Univ., UK
Volume
2
fYear
1999
fDate
15-19 Mar 1999
Firstpage
1001
Abstract
We report on the extension of capturing speech transitions embedded in diphones using trajectory models. The slowly varying dynamics of spectral trajectories carry much discriminant information that is very crudely modelled by traditional approaches such as HMMs. We improved our methodology of explicitly capturing the trajectory of short time spectral parameter vectors introducing multi-trajectory concepts in a probabilistic framework. Optimal subspace selection is presented which finds the most discriminant plane for classification. Using the E-set from the TIMIT database results suggest that discriminant information is preserved in the subspace
Keywords
parameter space methods; probability; signal classification; spectral analysis; speech processing; E-set; HMM; TIMIT database results; classification; diphone multi-trajectory subspace models; discriminant information; optimal subspace selection; probabilistic framework; short time spectral parameter vectors; slowly varying dynamics; spectral trajectories; speech signals; speech transitions; Context modeling; Databases; Explosions; Hidden Markov models; Information resources; Parameter estimation; Principal component analysis; Recurrent neural networks; Robustness; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.759870
Filename
759870
Link To Document