DocumentCode :
961651
Title :
Speech modelling using cepstral-time feature matrices in hidden Markov models
Author :
Vaseghi, S.V. ; Conner, P.N. ; Milner, B.P.
Author_Institution :
Sch. of Inf. Syst., East Anglia Univ., Norwich, UK
Volume :
140
Issue :
5
fYear :
1993
Firstpage :
317
Lastpage :
320
Abstract :
The paper explores the use of 2-dimensional cepstral-time features for the utilisation of correlation among successive speech spectral vectors, within a hidden-Markov-model (HMM) framework. A cepstral-time-feature matrix is obtained from a 2-dimensional discrete cosine transform of a spectral-time matrix. Advantages of cepstral-time features are that cepstral-time-feature matrices are a simple and robust method of representing short-time variation of speech spectral parameters; a cepstral-time matrix contains information on the transitional dynamics of feature vectors within the matrix; speech recognition based on cepstral time matrices is more robust in noisy environments; and use of a matrix of M cepstral vectors implies a minimum HMM-state duration constraint of M vector units. A simple framework investigated in the paper for applications of cepstral-time features is a finite-state-matrix quantiser (FSMQ), a special case of the HMM. It is used for initialisation of the training phase of HMMs.<>
Keywords :
discrete cosine transforms; hidden Markov models; matrix algebra; speech recognition; 2-dimensional discrete cosine transform; HMM; cepstral-time feature matrices; correlation; feature vectors; finite-state-matrix quantiser; noisy environments; speech modelling; speech recognition; speech spectral parameters; speech spectral vectors; training phase initialisation; transitional dynamics;
fLanguage :
English
Journal_Title :
Communications, Speech and Vision, IEE Proceedings I
Publisher :
iet
ISSN :
0956-3776
Type :
jour
Filename :
240149
Link To Document :
بازگشت