DocumentCode
294656
Title
Experiments using data augmentation for speaker adaptation
Author
Bellegarda, Jerome R. ; De Souza, Peter V. ; Nahamoo, David ; Padmanabhan, Mukund ; Picheny, Michael A. ; Bahl, Lalit R.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
692
Abstract
Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work considers the slightly different paradigm of customizing some reference data for the purpose of populating the new speaker´s space, and then using the resulting (augmented) data to derive the customized models. The data augmentation technique is based on the metamorphic algorithm first proposed in Bellegarda et al. [1992], assuming that a relatively modest amount of data (100 sentences) is available from each new speaker. This contraint requires that reference speakers be selected with some care. The performance of this method is illustrated on a portion of the Wall Street Journal task
Keywords
natural languages; speech recognition; Wall Street Journal task; customized models; data augmentation; metamorphic algorithm; reference data; speaker adaptation; Error analysis; Hidden Markov models; Loudspeakers; Natural languages; Prototypes; Speech analysis; Speech recognition; Switches; Testing; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479788
Filename
479788
Link To Document