Title :
Universal background model based speech recognition
Author :
Povey, Daniel ; Chu, Stephen M. ; Varadarajan, Balakrishnan
Author_Institution :
T.J. Watson Res. Center, IBM, Yorktown Heights, NY
fDate :
March 31 2008-April 4 2008
Abstract :
The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning. The proposed UBM approach is benchmarked on a state-of-the-art large-vocabulary continuous speech recognition platform on a broadcast transcription task. Preliminary experiments reported in this paper already show very exciting results.
Keywords :
parameter estimation; speech recognition; ASR; acoustic modeling; pruning methods; smoothing methods; speaker recognition; speech recognition; tree-based parameter estimation technique; universal background model; Automatic speech recognition; Broadcasting; Context modeling; Gaussian processes; Loudspeakers; Parameter estimation; Smoothing methods; Speaker recognition; Speech recognition; Statistics; UBM; acoustic modeling; speech recognition; universal background model;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518671