DocumentCode :
1742224
Title :
A Markov random field model for automatic speech recognition
Author :
Gravier, Guillaume ; Sigelle, Marc ; Chollet, Gérard
Author_Institution :
ENST, Paris, France
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
254
Abstract :
Speech can be represented as a time/frequency distribution of energy using a multiband filter bank. A Markov random field model, which takes into account the possible time asynchrony across the bands, is estimated for each segmental units to be recognized. The law of the speech process is given by a parametric Gibbs distribution and a maximum likelihood parameter estimation algorithm is developed. Experiments are conducted on an isolated word recognition problem. It is shown that similar performances are obtained with the new model and with standard HMM techniques in the mono-band case. In the multiband case, it is shown that modeling interband synchrony is an interesting approach to increase the performance when the number of bands increases
Keywords :
filtering theory; hidden Markov models; maximum likelihood estimation; speech recognition; time-frequency analysis; HMM techniques; Markov random field model; automatic speech recognition; interband synchrony; isolated word recognition problem; maximum likelihood parameter estimation; multiband filter bank; parametric Gibbs distribution; speech process; speech representation; time asynchrony; time/frequency energy distribution; Additive noise; Automatic speech recognition; Cepstral analysis; Filter bank; Hidden Markov models; Markov random fields; Noise robustness; Parameter estimation; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location :
Barcelona
ISSN :
1051-4651
Print_ISBN :
0-7695-0750-6
Type :
conf
DOI :
10.1109/ICPR.2000.903533
Filename :
903533
Link To Document :
بازگشت