• DocumentCode
    294563
  • Title

    Viterbi algorithm for acoustic vectors generated by a linear stochastic differential equation on each state

  • Author

    Saerens, Marco

  • Author_Institution
    IRIDIA Lab., Univ. Libre de Bruxelles, Belgium
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    233
  • Abstract
    When using hidden Markov models for speech recognition, it is usually assumed that the probability that a particular acoustic vector is emitted at a given time only depends on the current state and the current acoustic vector observed. We introduce another idea, i.e., we assume that, in a given state, the acoustic vectors are generated by a linear stochastic differential equation. This work is motivated by the fact that the time evolution of the acoustic vector is inherently dynamic and continuous. So that the modelling could be performed in the continuous-time domain instead of the discrete-time domain. By the way, the links between the discrete-time model obtained after sampling, and the original continuous-time signal are not so trivial. In particular, the relationship between the coefficients of a continuous-time linear process and the coefficients of the discrete-time linear process obtained after sampling is nonlinear. We assign a probability density to the continuous-time trajectory of the acoustic vector inside the state, reflecting the probability that this particular path has been generated by the stochastic differential equation associated with this state. This allows us to compute the likelihood of the uttered word. Reestimation formulae for the parameters of the process, based on the maximization of the likelihood, can be derived for the Viterbi algorithm. As usual, the segmentation can be obtained by sampling the continuous process, and by applying dynamic programming to find the best path over all the possible sequences of states
  • Keywords
    acoustic signal processing; difference equations; dynamic programming; feature extraction; hidden Markov models; linear differential equations; maximum likelihood estimation; signal sampling; speech processing; speech recognition; time-domain analysis; Viterbi algorithm; acoustic vectors; continuous-time domain; continuous-time linear process; continuous-time signal; continuous-time trajectory; discrete-time linear process; discrete-time model; dynamic programming; hidden Markov models; likelihood maximization; linear stochastic differential equation; probability density; sampling; speech recognition; time evolution; Acoustic emission; Differential equations; Dynamic programming; Hidden Markov models; Sampling methods; Signal sampling; Speech recognition; Stochastic processes; Vectors; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479407
  • Filename
    479407