Viterbi algorithm for acoustic vectors generated by a linear stochastic differential equation on each state

Author

Saerens, Marco

Author_Institution

IRIDIA Lab., Univ. Libre de Bruxelles, Belgium

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

233

Abstract

When using hidden Markov models for speech recognition, it is usually assumed that the probability that a particular acoustic vector is emitted at a given time only depends on the current state and the current acoustic vector observed. We introduce another idea, i.e., we assume that, in a given state, the acoustic vectors are generated by a linear stochastic differential equation. This work is motivated by the fact that the time evolution of the acoustic vector is inherently dynamic and continuous. So that the modelling could be performed in the continuous-time domain instead of the discrete-time domain. By the way, the links between the discrete-time model obtained after sampling, and the original continuous-time signal are not so trivial. In particular, the relationship between the coefficients of a continuous-time linear process and the coefficients of the discrete-time linear process obtained after sampling is nonlinear. We assign a probability density to the continuous-time trajectory of the acoustic vector inside the state, reflecting the probability that this particular path has been generated by the stochastic differential equation associated with this state. This allows us to compute the likelihood of the uttered word. Reestimation formulae for the parameters of the process, based on the maximization of the likelihood, can be derived for the Viterbi algorithm. As usual, the segmentation can be obtained by sampling the continuous process, and by applying dynamic programming to find the best path over all the possible sequences of states

Keywords

acoustic signal processing; difference equations; dynamic programming; feature extraction; hidden Markov models; linear differential equations; maximum likelihood estimation; signal sampling; speech processing; speech recognition; time-domain analysis; Viterbi algorithm; acoustic vectors; continuous-time domain; continuous-time linear process; continuous-time signal; continuous-time trajectory; discrete-time linear process; discrete-time model; dynamic programming; hidden Markov models; likelihood maximization; linear stochastic differential equation; probability density; sampling; speech recognition; time evolution; Acoustic emission; Differential equations; Dynamic programming; Hidden Markov models; Sampling methods; Signal sampling; Speech recognition; Stochastic processes; Vectors; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479407

Filename

479407