• DocumentCode
    352334
  • Title

    Fast speaker adaptation of large vocabulary continuous density HMM speech recognizer using a basis transform approach

  • Author

    Boulis, Constantinos ; Digalakis, Vassilios

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Abstract
    Maximum likelihood transformation-adaptation techniques have proven successful, but it is believed that faster convergence to speaker dependent (SD) performance can be achieved if we incorporate some form of a-priori knowledge in the adaptation process. In this paper, instead of estimating one linear transform per class of models for each new speaker, we transform the speaker-independent (SI) models using multiple linear transforms and a weight vector. To reduce the number of adaptation parameters, the multiple linear transforms are generated from training speakers and the adaptation parameters consist of a single weight vector per class. This can be seen as incorporating a-priori knowledge to our estimation process. Experiments conducted on the Spoken Language Translator database in the Swedish language using SRI´s DECIPHERTM system, show that the new method outperforms maximum likelihood linear regression on very limited adaptation data
  • Keywords
    hidden Markov models; maximum likelihood estimation; speech recognition; transforms; DECIPHERTM system; Spoken Language Translator database; Swedish; a-priori knowledge; adaptation process; basis transform approach; convergence; fast speaker adaptation; large vocabulary continuous density HMM speech recognizer; linear transform; maximum likelihood transformation-adaptation techniques; multiple linear transforms; speaker dependent performance; speaker-independent models; weight vector; Equations; Hidden Markov models; Maximum likelihood estimation; Maximum likelihood linear regression; Prototypes; Speech processing; Speech recognition; Transforms; Vectors; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.859128
  • Filename
    859128