• DocumentCode
    3008295
  • Title

    Markov modeling of continuous parameters in speech recognition

  • Author

    Soudoplatoff, Serge

  • Author_Institution
    IBM France Scientific center, Paris, France
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    45
  • Lastpage
    48
  • Abstract
    This paper presents how to avoid the labelling part of a speech recognition strategy based on hidden Markov models, while keeping a stochastic formulation. After a brief recall of how a Markov model can be used for speech recognition, we propose another formulation, in which the labels are suppressed, dealing only with continuous parameters. The notion of speech generator is then introduced, and the formulas for speech training as well as decoding are rewritten. This new formulation leads to the fact that the probability densities p(x | G) , where G is a generator, and x an acoustic vector, must be estimated. We explain our choice of non-parametric methods, using Parzen estimators. Those estimators require a kernel function, which we choose in a simple manner, and the value for the radius of the kernel, which is the key problem. Successively statistical solution, information theory solution, and an original topological solution are presented, the last being retained. We finally present the results of an application of this model to a 5000 words speech recognition system. The results showed that one can decrease the error-rate, by switching from a simple labelling scheme to this continuous parameter model.
  • Keywords
    Decoding; Hidden Markov models; Information theory; Kernel; Labeling; Markov processes; Speech recognition; Stochastic processes; Switches; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1169180
  • Filename
    1169180