• DocumentCode
    302327
  • Title

    A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition

  • Author

    Siohan, Olivier ; Gong, Yifan

  • Author_Institution
    CRIN, Vandoeuvre les Nancy, France
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    471
  • Abstract
    We propose a model of phoneme-based speech unit, called semi-continuous stochastic trajectory model (SC-STM), which generalizes our stochastic trajectory models (STM). As STMs, the SC-STMs focus on the modeling of speech segments (called trajectories) in their parameter space, and can therefore handle segmental information, which is critical for large vocabulary continuous speech recognition. Compared to the STMs, the SC-STMs improve the resolution of the trajectory modeling, while keeping a moderate number of free parameters by sharing state probability density functions. The SC-STM can therefore maintain a good trade-off between detailed acoustic modeling and limited training data. We tested the idea on a 2010 words, speaker-dependent, continuous speech database. Preliminary results show that SC-STM gives a word accuracy close to that of STM, without using heuristic techniques that enhanced STM
  • Keywords
    parameter estimation; probability; speech processing; speech recognition; stochastic processes; acoustic modeling; large vocabulary continuous speech recognition; limited training data; parameter space; phoneme based continuous speech recognition; phoneme based speech unit; segmental information; semicontinuous stochastic trajectory model; speaker dependent continuous speech database; speech modeling; speech segments; state probability density functions; trajectory modeling resolution; word accuracy; Acoustic testing; Context modeling; Databases; Hidden Markov models; Probability density function; Solid modeling; Speech recognition; Stochastic processes; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541135
  • Filename
    541135