DocumentCode
3424829
Title
Universal background model based speech recognition
Author
Povey, Daniel ; Chu, Stephen M. ; Varadarajan, Balakrishnan
Author_Institution
T.J. Watson Res. Center, IBM, Yorktown Heights, NY
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4561
Lastpage
4564
Abstract
The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning. The proposed UBM approach is benchmarked on a state-of-the-art large-vocabulary continuous speech recognition platform on a broadcast transcription task. Preliminary experiments reported in this paper already show very exciting results.
Keywords
parameter estimation; speech recognition; ASR; acoustic modeling; pruning methods; smoothing methods; speaker recognition; speech recognition; tree-based parameter estimation technique; universal background model; Automatic speech recognition; Broadcasting; Context modeling; Gaussian processes; Loudspeakers; Parameter estimation; Smoothing methods; Speaker recognition; Speech recognition; Statistics; UBM; acoustic modeling; speech recognition; universal background model;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518671
Filename
4518671
Link To Document