DocumentCode
1513347
Title
A Multi-Resolution Hidden Markov Model Using Class-Specific Features
Author
Baggenstoss, Paul M.
Author_Institution
Naval Undersea Warfare Center, Newport, RI, USA
Volume
58
Issue
10
fYear
2010
Firstpage
5165
Lastpage
5177
Abstract
We apply the PDF projection theorem to generalize the hidden Markov model (HMM) to accommodate multiple simultaneous segmentations of the raw data and multiple feature extraction transformations. Different segment sizes and feature transformations are assigned to each state. The algorithm averages over all allowable segmentations by mapping the segmentations to a “proxy” HMM and using the forward procedure. A by-product of the algorithm is the set of a posteriori state probability estimates that serve as a description of the input data. These probabilities have simultaneously the temporal resolution of the smallest processing windows and the processing gain and frequency resolution of the largest processing windows. The method is demonstrated on the problem of precisely modeling the consonant “T” in order to detect the presence of a distinct “burst” component. We compare the algorithm against standard speech analysis methods using data from the TIMIT corpus.
Keywords
hidden Markov models; probability; speech processing; PDF projection theorem; a posteriori state probability estimates; class-specific features; feature transformation; frequency resolution; multiple feature extraction; multiple simultaneous segmentation; multiresolution hidden Markov model; processing gain; temporal resolution; Automatic speech recognition; Cepstral analysis; Digital signal processing; Frequency; Hidden Markov models; Permission; Signal processing; Speech analysis; Speech processing; Wavelet packets; Markov processes; speech processing; time series analysis;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/TSP.2010.2052458
Filename
5483088
Link To Document