• DocumentCode
    310576
  • Title

    Experiments in speaker normalisation and adaptation for large vocabulary speech recognition

  • Author

    Pye, D. ; Woodland, P.C.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1047
  • Abstract
    This paper examines techniques for speaker normalisation and adaptation that are applied in training with the aim of removing some of the variability from the speaker independent models. Two techniques are examined: vocal tract normalisation (VTN) which estimates a single “vocal tract length” parameter for each speaker and then modifies the speech parameterisation accordingly and speaker adaptive training (SAT) which estimates Gaussian mean and variance parameters jointly with a speaker specific set of maximum likelihood linear regression (MLLR) based transformations. It is shown that VTN is effective for both clean speech and mismatched conditions and that the further improvements obtained by applying MLLR in testing are essentially additive. Detailed results from the use of SAT show that worthwhile improvements over using MLLR with standard speaker independent models are obtained
  • Keywords
    Gaussian processes; hidden Markov models; maximum likelihood estimation; speech recognition; Gaussian mean parameters; HMM; clean speech; large vocabulary speech recognition; maximum likelihood linear regression; mismatched conditions; parameter estimation; speaker adaptation; speaker adaptive training; speaker independent models; speaker normalisation; variance parameters; vocal tract length; vocal tract normalisation; Cepstral analysis; Cepstrum; Frequency; Maximum likelihood linear regression; Parameter estimation; Speech recognition; System testing; Training data; Vectors; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596120
  • Filename
    596120