Title :
Diagonal priors for full covariance speech recognition
Author :
Bell, Peter ; King, Simon
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
We investigate the use of full covariance Gaussians for large-vocabulary speech recognition. The large number of parameters gives high modelling power, but when training data is limited, the standard sample covariance matrix is often poorly conditioned, and has high variance. We explain how these problems may be solved by the use of a diagonal covariance smoothing prior, and relate this to the shrinkage estimator, for which the optimal shrinkage parameter may itself be estimated from the training data. We also compare the use of generatively and discriminatively trained priors. Results are presented on a large vocabulary conversational telephone speech recognition task.
Keywords :
Gaussian processes; covariance matrices; smoothing methods; speech recognition; vocabulary; diagonal covariance smoothing method; full covariance Gaussian method; full covariance speech recognition; large-vocabulary speech recognition; shrinkage estimator; standard sample covariance matrix; telephone speech recognition task; Automatic speech recognition; Covariance matrix; Gaussian processes; Informatics; Smoothing methods; Speech recognition; Telephony; Training data; Unsolicited electronic mail; Vocabulary;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373344