مرکز منطقه ای اطلاع رساني علوم و فناوري - Discrete utterance speech recognition without time alignment

DocumentCode :

936932

Title :

Discrete utterance speech recognition without time alignment

Author :

Shore, John E. ; Burton, David K.

Volume :

Issue :

fYear :

1983

fDate :

7/1/1983 12:00:00 AM

Firstpage :

473

Lastpage :

491

Abstract :

The results of a new method are presented for discrete utterance speech recognition. The method is based on rate-distortion speech coding (speech coding by vector quantization), minimum cross-entropy pattern classification, and information-theoretic spectral distortion measures. Separate vector quantization code books are designed from training sequences for each word in the recognition vocabulary. Inputs from outside the training sequence are classified by performing vector quantization and finding the code book that achieves the lowest average distortion per speech frame. The new method obviates time alignment. It achieves $99$ percent accuracy for speaker-dependent recognition of a $20$ -word vocabulary that includes the ten digits, with higher accuracy for recognition of the digit subset. For speaker-independent recognition, the method achieves $88$ percent accuracy for the $20$ -word vocabulary and $95$ percent for the digit subset. Background of the method, detailed empirical results, and an analysis of computational requirements are presented.

Keywords :

Rate-distortion theory; Speech coding; Speech recognition; Books; Distortion measurement; Information theory; Nonlinear distortion; Pattern classification; Pattern recognition; Speech coding; Speech recognition; Vector quantization; Vocabulary;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.1983.1056716

Filename :

1056716

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=936932