• DocumentCode
    1065237
  • Title

    Acoustic-phonetic modeling in the SPICOS system

  • Author

    Ney, Hermann ; Noll, Andreas

  • Author_Institution
    Lehrstuhl fuer Inf. VI, Rheinisch-Westfalische Tech. Hochschule, Aachen, Germany
  • Volume
    2
  • Issue
    2
  • fYear
    1994
  • fDate
    4/1/1994 12:00:00 AM
  • Firstpage
    312
  • Lastpage
    319
  • Abstract
    This paper deals with the acoustic-phonetic modeling developed in the SPICOS recognition system. The phoneme units are represented by variants of hidden Markov models and are refined step-by-step to improve the recognition performance. Two different approaches to modeling the emission probabilities are investigated, namely discrete models and continuous mixture densities. For the mixture density approach, a straightforward integration into the Viterbi scoring criterion is derived. In the experimental tests, the continuous mixture density approach was found to be superior to the discrete models, given the constraints of the Viterbi framework. The authors describe the details and refinements of the two approaches
  • Keywords
    hidden Markov models; probability; speech recognition equipment; SPICOS recognition system; Viterbi scoring criterion; acoustic-phonetic modeling; continuous mixture densities; discrete models; emission probabilities; hidden Markov models; phoneme units; recognition performance; Acoustic testing; Context modeling; Gaussian distribution; Hidden Markov models; Robustness; Speech recognition; Terrorism; Training data; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.279280
  • Filename
    279280