• DocumentCode
    294631
  • Title

    Structured Markov models for speech recognition

  • Author

    Wolfertstetter, F. ; Ruske, G.

  • Author_Institution
    Inst. for Human-Machine-Commun., Munich Univ. of Technol., Germany
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    544
  • Abstract
    This paper proposes a new modeling of the structure of speech units as a graph consisting of base functions and a transition network. A cluster algorithm taking into account the actual temporal context of the feature vectors is used to generate the base functions, which are approximated by normal distributions. The subsequent Viterbi-based maximum-likelihood training procedure establishes the transition network and adjusts the transition probabilities. The emerging graphs for the speech units are a structure of branching and recombining trajectory segments describing statistical dependencies in the feature vector sequence within the speech units as well as in the transition regions between them. A speaker-independent evaluation shows the superiority of the proposed modeling compared to mixture-state HMMs, even for an equal number of model parameters
  • Keywords
    Markov processes; graph theory; maximum likelihood estimation; normal distribution; sequences; speech processing; speech recognition; Viterbi-based maximum-likelihood training; base functions; branching trajectory segments; cluster algorithm; feature vector sequence; feature vectors; graph; mixture-state HMM; model parameters; normal distributions; recombining trajectory segments; speaker-independent evaluation; speech recognition; speech units; statistical dependencies; structured Markov models; temporal context; transition network; transition probabilities; transition regions; Clustering algorithms; Electronic mail; Gaussian distribution; Hidden Markov models; Leg; Loudspeakers; Probability density function; Prototypes; Speech recognition; Vectors; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479649
  • Filename
    479649