DocumentCode
323834
Title
Parametric subspace modelling of speech transitions
Author
Reinhard, Klaus ; Niranjan, Mahesan
Author_Institution
Dept. of Eng., Cambridge Univ., UK
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
1105
Abstract
We report on attempting to capture segmental transition information for speech recognition tasks. The slowly varying dynamics of spectral trajectories carries much discriminant information that is very crudely modelled by traditional approaches such as HMMs. In attempts such as recurrent neural networks there is the hope, but not convincing demonstration, that such transitional information could be captured. We start from the very different position of explicitly capturing the trajectory of short time spectral parameter vectors on a subspace in which the temporal sequence information is preserved (time constrained principal component analysis). On this subspace, we attempt a parametric modelling of the trajectory, and compute a distance metric to perform classification of diphones. Much of the discriminant information is still retained in this subspace. This is illustrated on the isolated transitions /bee/,/dee/ and /gee/
Keywords
parameter estimation; pattern classification; recurrent neural nets; spectral analysis; speech processing; HMM; diphones classification; discriminant information; distance metric; isolated speech transitions; parametric subspace modelling; recurrent neural networks; segmental transition information; short time spectral parameter vectors; slowly varying dynamics; spectral trajectories; speech recognition; temporal sequence information; time constrained principal component analysis; Context modeling; Hidden Markov models; Information resources; Parameter estimation; Parametric statistics; Principal component analysis; Recurrent neural networks; Speech recognition; Subspace constraints; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675462
Filename
675462
Link To Document