DocumentCode :
936932
Title :
Discrete utterance speech recognition without time alignment
Author :
Shore, John E. ; Burton, David K.
Volume :
29
Issue :
4
fYear :
1983
fDate :
7/1/1983 12:00:00 AM
Firstpage :
473
Lastpage :
491
Abstract :
The results of a new method are presented for discrete utterance speech recognition. The method is based on rate-distortion speech coding (speech coding by vector quantization), minimum cross-entropy pattern classification, and information-theoretic spectral distortion measures. Separate vector quantization code books are designed from training sequences for each word in the recognition vocabulary. Inputs from outside the training sequence are classified by performing vector quantization and finding the code book that achieves the lowest average distortion per speech frame. The new method obviates time alignment. It achieves 99 percent accuracy for speaker-dependent recognition of a 20 -word vocabulary that includes the ten digits, with higher accuracy for recognition of the digit subset. For speaker-independent recognition, the method achieves 88 percent accuracy for the 20 -word vocabulary and 95 percent for the digit subset. Background of the method, detailed empirical results, and an analysis of computational requirements are presented.
Keywords :
Rate-distortion theory; Speech coding; Speech recognition; Books; Distortion measurement; Information theory; Nonlinear distortion; Pattern classification; Pattern recognition; Speech coding; Speech recognition; Vector quantization; Vocabulary;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.1983.1056716
Filename :
1056716
Link To Document :
بازگشت