• DocumentCode
    1749674
  • Title

    Very fast adaptation with a compact context-dependent eigenvoice model

  • Author

    Kuhn, R. ; Perronnin, E. ; Nguyen, P. ; Junqua, J.C. ; Rigazio, L.

  • Author_Institution
    Panasonic Speech Technol. Lab., Panasonic Technol. Inc, Santa Barbara, CA, USA
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    373
  • Abstract
    The "eigenvoice" technique achieves rapid speaker adaptation by employing prior knowledge of speaker space obtained from reference speakers to place strong constraints on the initial model for each new speaker. It has previously been shown to yield very fast adaptation for a large-vocabulary system. In this paper, we describe a new way of applying the eigenvoice technique to context-dependent acoustic modeling, called the "eigencentroid plus delta trees" (EDT) model. Here, the context-dependent model is defined so that it consists of a speaker-dependent component with a small number of parameters linked to a speaker-independent component with far more parameters. The eigenvoice technique can then be applied to the speaker-dependent component alone to attain very fast adaptation of the entire context-dependent model (e.g., 10% relative reduction in error rate after 3 sentences). EDT requires only a small number of parameters to represent speaker space and works even if only a small amount of data is available per reference speaker
  • Keywords
    eigenvalues and eigenfunctions; speech recognition; trees (mathematics); EDT model; compact context-dependent eigenvoice model; context-dependent acoustic modeling; eigencentroid plus delta trees; speaker adaptation; speaker space; speaker-dependent component; speaker-independent component; speech recognition; very fast adaptation; Context modeling; Error analysis; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Space technology; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940845
  • Filename
    940845