مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic speech recognition via pseudo-independent marginal mixtures

DocumentCode :

3013024

Title :

Automatic speech recognition via pseudo-independent marginal mixtures

Author :

Nadas, Andras ; Nahamoo, David

Author_Institution :

IBM T. J. Watson Research Center, Yorktown Heights, NY

Volume :

fYear :

1987

fDate :

31868

Firstpage :

1285

Lastpage :

1287

Abstract :

Statistical models (prototypes) for the multivariate probability distribution of vectors (frames) of speech parameters may be utilized in various ways. If the stream of vectors is passed directly to the decoder of a continuous parameter speech recognizer then the prototypes are used by the decoder; if the recognizer has a time-synchronous labeling acoustic processor then they are used for vector quantization (labeling) and the resulting label stream is passed to the decoder; other uses are possible as well. We present a method for constructing such prototypes. This method was chosen as a compromise between describing a prototype in an assumption free way as a nonparametric density and describing it in a convenient way as a simple multivariate Gaussian density. We describe speech recognition experiments where our prototypes were trained by iteratively interleaving steps of a K-MEANS type algorithm for clustering and steps of an EM algorithm for reestimation. We present results (using a labeling acoustic processor) having significantly fewer decoding errors than our previous methods do.

Keywords :

Automatic speech recognition; Clustering algorithms; Decoding; Iterative algorithms; Labeling; Probability distribution; Prototypes; Speech processing; Speech recognition; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.

Type :

conf

DOI :

10.1109/ICASSP.1987.1169454

Filename :

1169454

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3013024