Structured Markov models for speech recognition

Author

Wolfertstetter, F. ; Ruske, G.

Author_Institution

Inst. for Human-Machine-Commun., Munich Univ. of Technol., Germany

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

544

Abstract

This paper proposes a new modeling of the structure of speech units as a graph consisting of base functions and a transition network. A cluster algorithm taking into account the actual temporal context of the feature vectors is used to generate the base functions, which are approximated by normal distributions. The subsequent Viterbi-based maximum-likelihood training procedure establishes the transition network and adjusts the transition probabilities. The emerging graphs for the speech units are a structure of branching and recombining trajectory segments describing statistical dependencies in the feature vector sequence within the speech units as well as in the transition regions between them. A speaker-independent evaluation shows the superiority of the proposed modeling compared to mixture-state HMMs, even for an equal number of model parameters

Keywords

Markov processes; graph theory; maximum likelihood estimation; normal distribution; sequences; speech processing; speech recognition; Viterbi-based maximum-likelihood training; base functions; branching trajectory segments; cluster algorithm; feature vector sequence; feature vectors; graph; mixture-state HMM; model parameters; normal distributions; recombining trajectory segments; speaker-independent evaluation; speech recognition; speech units; statistical dependencies; structured Markov models; temporal context; transition network; transition probabilities; transition regions; Clustering algorithms; Electronic mail; Gaussian distribution; Hidden Markov models; Leg; Loudspeakers; Probability density function; Prototypes; Speech recognition; Vectors; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479649

Filename

479649