• DocumentCode
    284632
  • Title

    Predictor codebooks for speaker-independent speech recognition

  • Author

    Kawabata, Takshi

  • Author_Institution
    NTT Basic Res. Labs., Tokyo, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    353
  • Abstract
    The authors examine the speech recognition capabilities of predictor codebooks under multi-speaker and speaker-independent conditions. Three structures of spectrum predictors, a forward predictor, a backward predictor, and an interpolator, are examined. Predictor codebooks are generated by the LBG algorithm with a small modification for predictor quantization. The predictor codebooks are then tested on a phone recognition task with three different measurements. The degradation in predictor-codebook performance was reduced by one-third under speaker-independent conditions. Finally, continuous-speech recognition experiments are carried out using the predictor codebook for multi-speaker and speaker-independent conditions. The results show that the backward-predictor codebook is very effective
  • Keywords
    linear predictive coding; speech recognition; LBG algorithm; LPC, multispeaker recognition; backward predictor; continuous-speech recognition; forward predictor; interpolator; phone recognition; predictor codebooks; predictor quantization; speaker-independent speech recognition; spectrum predictors; Cepstrum; Degradation; Hidden Markov models; Laboratories; Linear predictive coding; Quantization; Speech analysis; Speech coding; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225899
  • Filename
    225899