• DocumentCode
    3229987
  • Title

    An LVQ based reference model for speaker-adaptive speech recognition

  • Author

    Schmidbauer, O. ; Tebelskis, J.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    441
  • Abstract
    A novel type of hierarchical phoneme model for speaker adaptation, based on both hidden Markov models (HMM) and learned vector quantization (LVQ) networks is presented. Low-level tied LVQ phoneme models are trained speaker-dependently and independently, yielding a pool of speaker-biased phoneme models which can be mixed into high-level speaker-adaptive phoneme models. Rapid speaker adaptation is performed by finding an optimal mixture for these models at recognition time, given only a small amount of speech data; subsequently, the models are fine-tuned to the new speaker´s voice by further parameter reestimation. In preliminary experiments with a continuous speech task using 40 context-free phoneme models at task perplexity 111, the authors achieved 82% word accuracy for speaker-dependent recognition and 73% in the speaker-adaptive mode
  • Keywords
    hidden Markov models; speech recognition; vector quantisation; HMM; context-free phoneme models; continuous speech task; hidden Markov models; hierarchical phoneme model; high-level speaker-adaptive phoneme models; learned vector quantization; optimal mixture; parameter reestimation; perplexity; recognition time; speaker-adaptive speech recognition; speaker-biased phoneme models; speaker-dependent training; speaker-independent training; speech data; tied LVQ phoneme models; Adaptation model; Computer science; Context modeling; Hidden Markov models; Neural networks; Probability distribution; Speech recognition; Statistical analysis; System testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225877
  • Filename
    225877