DocumentCode :
3347807
Title :
A multimodal variational approach to learning and inference in switching state space models [speech processing application]
Author :
Lee, Leo J. ; Attias, Hagai ; Deng, Li ; Fieguth, Paul
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
An important general model for discrete-time signal processing is the switching state space (SSS) model, which generalizes the hidden Markov model and the Gaussian state space model. Inference and parameter estimation in this model are known to be computationally intractable. This paper presents a powerful new approximation to the SSS model. The approximation is based on a variational technique that preserves the multimodal nature of the continuous state posterior distribution. Furthermore, by incorporating a windowing technique, the resulting EM algorithm has complexity that is just linear in the length of the time series. An alternative Viterbi decoding with frame-based likelihood is also presented which is crucial for the speech application that originally motivates this work. Our experiments focus on demonstrating the effectiveness of the algorithm by extensive simulations. A typical example in speech processing is also included to show the potential of this approach for practical applications.
Keywords :
Gaussian processes; Viterbi decoding; discrete time systems; frame based representation; hidden Markov models; learning (artificial intelligence); model-based reasoning; parameter estimation; speech processing; state-space methods; time series; variational techniques; EM algorithm; Gaussian state space model; SSS model approximation; continuous state posterior distribution; discrete-time signal processing; frame-based likelihood Viterbi decoding; hidden Markov model; inference; learning; multimodal variational technique; parameter estimation; speech processing; switching state space models; time series; windowing technique; Design engineering; Hidden Markov models; Parameter estimation; Power system modeling; Signal processing; Signal processing algorithms; Speech processing; State-space methods; Systems engineering and theory; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327158
Filename :
1327158
Link To Document :
بازگشت