• DocumentCode
    936932
  • Title

    Discrete utterance speech recognition without time alignment

  • Author

    Shore, John E. ; Burton, David K.

  • Volume
    29
  • Issue
    4
  • fYear
    1983
  • fDate
    7/1/1983 12:00:00 AM
  • Firstpage
    473
  • Lastpage
    491
  • Abstract
    The results of a new method are presented for discrete utterance speech recognition. The method is based on rate-distortion speech coding (speech coding by vector quantization), minimum cross-entropy pattern classification, and information-theoretic spectral distortion measures. Separate vector quantization code books are designed from training sequences for each word in the recognition vocabulary. Inputs from outside the training sequence are classified by performing vector quantization and finding the code book that achieves the lowest average distortion per speech frame. The new method obviates time alignment. It achieves 99 percent accuracy for speaker-dependent recognition of a 20 -word vocabulary that includes the ten digits, with higher accuracy for recognition of the digit subset. For speaker-independent recognition, the method achieves 88 percent accuracy for the 20 -word vocabulary and 95 percent for the digit subset. Background of the method, detailed empirical results, and an analysis of computational requirements are presented.
  • Keywords
    Rate-distortion theory; Speech coding; Speech recognition; Books; Distortion measurement; Information theory; Nonlinear distortion; Pattern classification; Pattern recognition; Speech coding; Speech recognition; Vector quantization; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.1983.1056716
  • Filename
    1056716