DocumentCode :
323834
Title :
Parametric subspace modelling of speech transitions
Author :
Reinhard, Klaus ; Niranjan, Mahesan
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
1105
Abstract :
We report on attempting to capture segmental transition information for speech recognition tasks. The slowly varying dynamics of spectral trajectories carries much discriminant information that is very crudely modelled by traditional approaches such as HMMs. In attempts such as recurrent neural networks there is the hope, but not convincing demonstration, that such transitional information could be captured. We start from the very different position of explicitly capturing the trajectory of short time spectral parameter vectors on a subspace in which the temporal sequence information is preserved (time constrained principal component analysis). On this subspace, we attempt a parametric modelling of the trajectory, and compute a distance metric to perform classification of diphones. Much of the discriminant information is still retained in this subspace. This is illustrated on the isolated transitions /bee/,/dee/ and /gee/
Keywords :
parameter estimation; pattern classification; recurrent neural nets; spectral analysis; speech processing; HMM; diphones classification; discriminant information; distance metric; isolated speech transitions; parametric subspace modelling; recurrent neural networks; segmental transition information; short time spectral parameter vectors; slowly varying dynamics; spectral trajectories; speech recognition; temporal sequence information; time constrained principal component analysis; Context modeling; Hidden Markov models; Information resources; Parameter estimation; Parametric statistics; Principal component analysis; Recurrent neural networks; Speech recognition; Subspace constraints; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675462
Filename :
675462
Link To Document :
بازگشت