A Multi-Resolution Hidden Markov Model Using Class-Specific Features

Author

Baggenstoss, Paul M.

Author_Institution

Naval Undersea Warfare Center, Newport, RI, USA

Volume

58

Issue

10

fYear

2010

Firstpage

5165

Lastpage

5177

Abstract

We apply the PDF projection theorem to generalize the hidden Markov model (HMM) to accommodate multiple simultaneous segmentations of the raw data and multiple feature extraction transformations. Different segment sizes and feature transformations are assigned to each state. The algorithm averages over all allowable segmentations by mapping the segmentations to a “proxy” HMM and using the forward procedure. A by-product of the algorithm is the set of a posteriori state probability estimates that serve as a description of the input data. These probabilities have simultaneously the temporal resolution of the smallest processing windows and the processing gain and frequency resolution of the largest processing windows. The method is demonstrated on the problem of precisely modeling the consonant “T” in order to detect the presence of a distinct “burst” component. We compare the algorithm against standard speech analysis methods using data from the TIMIT corpus.

Keywords

hidden Markov models; probability; speech processing; PDF projection theorem; a posteriori state probability estimates; class-specific features; feature transformation; frequency resolution; multiple feature extraction; multiple simultaneous segmentation; multiresolution hidden Markov model; processing gain; temporal resolution; Automatic speech recognition; Cepstral analysis; Digital signal processing; Frequency; Hidden Markov models; Permission; Signal processing; Speech analysis; Speech processing; Wavelet packets; Markov processes; speech processing; time series analysis;

fLanguage

English

Journal_Title

Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

1053-587X

Type

jour

DOI

10.1109/TSP.2010.2052458

Filename

5483088