مرکز منطقه ای اطلاع رساني علوم و فناوري - Tied mixture continuous parameter modeling for speech recognition

DocumentCode :

1449905

Title :

Tied mixture continuous parameter modeling for speech recognition

Author :

Bellegarda, Jerome R. ; Nahamoo, David

Author_Institution :

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Volume :

Issue :

fYear :

1990

fDate :

12/1/1990 12:00:00 AM

Firstpage :

2033

Lastpage :

2045

Abstract :

The acoustic-modeling problem in automatic speech recognition is examined with the goal of unifying discrete and continuous parameter approaches. To model a sequence of information-bearing acoustic feature vectors which has been extracted from the speech waveform via some appropriate front-end signal processing, a speech recognizer basically faces two alternatives: (1) assign a multivariate probability distribution directly to the stream of vectors, or (2) use a time-synchronous labeling acoustic processor to perform vector quantization on this stream, and assign a multinomial probability distribution to the output of the vector quantizer. With a few exceptions, these two methods have traditionally been given separate treatment. A class of very general hidden Markov models which can accommodate feature vector sequences lying either in a discrete or in a continuous space is considered; the new class allows one to represent the prototypes in an assumption-limited, yet convenient way, as tied mixtures of simple multivariate densities. Speech recognition experiments, reported for two (5000- and 20000-word vocabulary) office correspondence tasks, demonstrate some of the benefits associated with this technique

Keywords :

acoustic signal processing; speech analysis and processing; speech recognition; acoustic feature vectors; acoustic-modeling problem; automatic speech recognition; continuous parameter modeling; discrete modelling; front-end signal processing; hidden Markov models; multinomial probability distribution; multivariate densities; multivariate probability distribution; office correspondence tasks; speech recognizer; speech waveform; tied mixtures; time-synchronous labeling acoustic processor; vector quantization; vector quantizer; vocabulary; Acoustic signal processing; Acoustic waves; Automatic speech recognition; Face recognition; Hidden Markov models; Labeling; Probability distribution; Speech processing; Speech recognition; Vector quantization;

fLanguage :

English

Journal_Title :

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

0096-3518

Type :

jour

DOI :

10.1109/29.61531

Filename :

61531

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1449905