Universal background model based speech recognition

Author

Povey, Daniel ; Chu, Stephen M. ; Varadarajan, Balakrishnan

Author_Institution

T.J. Watson Res. Center, IBM, Yorktown Heights, NY

fYear

2008

fDate

March 31 2008-April 4 2008

Firstpage

4561

Lastpage

4564

Abstract

The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning. The proposed UBM approach is benchmarked on a state-of-the-art large-vocabulary continuous speech recognition platform on a broadcast transcription task. Preliminary experiments reported in this paper already show very exciting results.

Keywords

parameter estimation; speech recognition; ASR; acoustic modeling; pruning methods; smoothing methods; speaker recognition; speech recognition; tree-based parameter estimation technique; universal background model; Automatic speech recognition; Broadcasting; Context modeling; Gaussian processes; Loudspeakers; Parameter estimation; Smoothing methods; Speaker recognition; Speech recognition; Statistics; UBM; acoustic modeling; speech recognition; universal background model;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location

Las Vegas, NV

ISSN

1520-6149

Print_ISBN

978-1-4244-1483-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2008.4518671

Filename

4518671