• DocumentCode
    337475
  • Title

    Fast speaker adaptation using a priori knowledge

  • Author

    Kuhn, R. ; Nguyen, R. ; Junqua, J.C. ; Boman, R. ; Niedzielski, N. ; Fincke, S. ; Field, K. ; Contolini, M.

  • Author_Institution
    Speech Technol. Lab., Panasonic Technol. Inc., Santa Barbara, CA, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    749
  • Abstract
    Previously, we presented a radically new class of fast adaptation techniques for speech recognition, based on prior knowledge of speaker variation. To obtain this prior knowledge, one applies a dimensionality reduction technique to T vectors of dimension D derived from T speaker-dependent (SD) models. This offline step yields T basis vectors, the eigenvoices. We constrain the model for new speaker S to be located in the space spanned by the first K eigenvoices. Speaker adaptation involves estimating K eigenvoice coefficients for the new speaker; typically, K is very small compared to original dimension D. Here, we review how to find the eigenvoices, give a maximum-likelihood estimator for the new speaker´s eigenvoice coefficients, and summarize mean adaptation experiments carried out on the Isolet database. We present new results which assess the impact on performance of changes in training of the SD models. Finally, we interpret the first few eigenvoices obtained
  • Keywords
    adaptive signal processing; maximum likelihood estimation; principal component analysis; speech recognition; Isolet database; a priori knowledge; basis vectors; dimensionality reduction technique; eigenvoice coefficients; fast speaker adaptation; maximum-likelihood estimator; mean adaptation experiments; offline step; performance; principal component analysis; speaker variation; speaker-dependent models; speech recognition; training; vector dimension; Databases; Hidden Markov models; Independent component analysis; Laboratories; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Surfaces; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759776
  • Filename
    759776