Title :
Combined-order hidden Markov models for reverberation-robust speech recognition
Author :
Maas, Roland ; Kotha, Sujan R. ; Sehr, Armin ; Kellermann, Walter
Author_Institution :
Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg Erlangen, Erlangen, Germany
Abstract :
In this contribution, the concept of combined-order hidden Markov models (CO-HMMs) is introduced by joining the first-order Markov and the second-order conditional independence assumption. The proposed approach is motivated and evaluated in the context of reverberation-robust automatic speech recognition. Two predecessor-dependent output probability density functions per hidden Markov model (HMM) state are employed in order to explicitly cope with the high inter-frame correlation in presence of reverberation. At the same time, the state duration modeling related to the first-order Markov assumption is addressed by a recently published training procedure based on hard alignment having the significant advantage that any conventional HMM can be efficiently updated to a CO-HMM. The experimental results show a reduction in average entropy as well as in word error rate in reverberant environments compared to conventional HMMs.
Keywords :
hidden Markov models; speech recognition; CO-HMM; combined order hidden Markov models; first-order Markov assumption; output probability density functions; reverberation robust speech recognition; second order conditional independence assumption; state duration modeling; Entropy; Hidden Markov models; Markov processes; Reverberation; Speech; Training; Vectors;
Conference_Titel :
Cognitive Information Processing (CIP), 2012 3rd International Workshop on
Conference_Location :
Baiona
Print_ISBN :
978-1-4673-1877-8
DOI :
10.1109/CIP.2012.6232918