Title :
A comparison of several variants of GMM on speech recognition task
Author :
Jakovljevic, Niksa M. ; Miskovic, Dragisa ; Pakoci, Edvin ; Grbic, Tatjana ; Delic, Vlado
Abstract :
In the automatic speech recognition task, the dominant approach is the statistical framework based on hidden Markov models in combination with Gaussian mixture models. The issues which should be solved are: how to obtain a statistically efficient estimation of model parameters, especially covariance matrix, whose number of parameters is proportional to the square of the dimensionality of the feature space, as well as sufficiently fast and accurate evaluation of observation emission probabilities. This paper provides the evaluation results of several models (diagonal approximation, maximum likelihood linear transformation, semi tied covariance) tested on Serbian SpeechDat corpus. In these experiments, the model with full covariance matrices has achieved the best performance (in terms of accuracy and computation time).
Keywords :
Gaussian processes; approximation theory; covariance matrices; hidden Markov models; maximum likelihood estimation; mixture models; speech recognition; GMM; Gaussian mixture models; Serbian SpeechDat corpus; automatic speech recognition task; diagonal approximation; feature dimensionality; full covariance matrix; hidden Markov models; maximum likelihood linear transformation; model parameter estimation; observation emission probabilities; semi tied covariance; statistical framework; Automatic speech recognition; Biological system modeling; Covariance matrices; Electronic mail; Gaussian mixture model; Hidden Markov models; Speech;
Conference_Titel :
Telecommunications Forum (TELFOR), 2013 21st
Conference_Location :
Belgrade
Print_ISBN :
978-1-4799-1419-7
DOI :
10.1109/TELFOR.2013.6716268