Title :
An Alphanet approach to optimising input transformations for continuous speech recognition
Author :
Bridle, J.S. ; Dodd, L.
Author_Institution :
R. Signals & Radar Establ., Malvern, UK
Abstract :
The authors extend to continuous speech recognition (CSR) the Alphanet approach to integrating backprop networks and HMM (hidden Markov model)-based isolated word recognition. They present the theory of a method for discriminative training of components of a CSR system, using training data in the form of complete sentences. The derivatives of the discriminative score with respect to the parameters are expressed in terms of the posterior probabilities of state occupancies (gammas) under two conditions called `clamped´ and `free´ because they correspond to the two conditions in Boltzmann machine training. The authors compute these clamped and free gammas using the forward-backward algorithm twice, and use the differences to drive the adaptation of a preprocessing data transformation, which can be thought of as replacing the linear transformation which yields MFCCs, or which normalizes a grand covariance matrix
Keywords :
Markov processes; neural nets; speech recognition; Alphanet approach; Boltzmann machine training; HMM; backprop networks; clamped gammas; complete sentences; continuous speech recognition; discriminative training; forward-backward algorithm; free gammas; hidden Markov model; input transformations; isolated word recognition; posterior probabilities of state occupancies; preprocessing data transformation; Covariance matrix; Distributed computing; Drives; Frequency estimation; Hidden Markov models; Length measurement; Linear discriminant analysis; Radar; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150331