Title :
A probabilistic acoustic map based discriminative HMM training
Author :
Huang, Eng-Fong ; Soong, Frank
Author_Institution :
Speech Res. Dept., AT&T Bell Lab., Murray Hill, NJ, USA
Abstract :
A hidden Markov model (HMM) training procedure is proposed for improving the discriminative power of a maximum-likelihood (ML)-based HMM. The discriminative HMM consists of two component models, a master model and a slave model. The master model is the conventional ML model. The slave model is obtained by aligning training tokens of a word with all but the correct word master models. All models are trained by projecting acoustic observations onto a set of common probabilistic basis functions, which is called a probabilistic acoustic map, and the output probability of a model is represented as a weighted sum of the basis functions. The proposed algorithm was tested on a 39-word, alpha-digit database of 100 speakers (50 male and 50 female). Specifically, the highly confusable E-set words were separately tested. Experimental results indicate that the new training procedure improved the recognition accuracy of the E-set words by 4.3% and of all 39 words by 3.6%
Keywords :
Markov processes; learning systems; probability; speech recognition; discriminative HMM; hidden Markov model; learning systems; probabilistic acoustic map; speech recognition; training procedure; training tokens; Acoustic testing; Cepstral analysis; Computational complexity; Covariance matrix; Databases; Hafnium; Hidden Markov models; Histograms; Lifting equipment; Loudspeakers; Master-slave; Maximum likelihood estimation; State estimation; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115856