• DocumentCode
    294656
  • Title

    Experiments using data augmentation for speaker adaptation

  • Author

    Bellegarda, Jerome R. ; De Souza, Peter V. ; Nahamoo, David ; Padmanabhan, Mukund ; Picheny, Michael A. ; Bahl, Lalit R.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    692
  • Abstract
    Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work considers the slightly different paradigm of customizing some reference data for the purpose of populating the new speaker´s space, and then using the resulting (augmented) data to derive the customized models. The data augmentation technique is based on the metamorphic algorithm first proposed in Bellegarda et al. [1992], assuming that a relatively modest amount of data (100 sentences) is available from each new speaker. This contraint requires that reference speakers be selected with some care. The performance of this method is illustrated on a portion of the Wall Street Journal task
  • Keywords
    natural languages; speech recognition; Wall Street Journal task; customized models; data augmentation; metamorphic algorithm; reference data; speaker adaptation; Error analysis; Hidden Markov models; Loudspeakers; Natural languages; Prototypes; Speech analysis; Speech recognition; Switches; Testing; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479788
  • Filename
    479788