DocumentCode :
2279343
Title :
Speech recognition using advanced HMM2 features
Author :
Weber, Katrin ; Bengio, Samy ; Bourlard, Hervé
Author_Institution :
IDIAP - Dalle Molle Inst. of Perceptual Artificial Intelligence, Martigny, Switzerland
fYear :
2001
fDate :
2001
Firstpage :
65
Lastpage :
68
Abstract :
HMM2 is a particular hidden Markov model where state emission probabilities of the temporal (primary) HMM are modeled through (secondary) state-dependent frequency-based HMMs (see Weber, K. et al., Proc. ICSGP, vol.III, p.147-50, 2000). As we show in another paper (see Weber et al., Proc. Eurospeech, Sep. 2001), a secondary HMM can also be used to extract robust ASR features. Here, we further investigate this novel approach towards using a full HMM2 as feature extractor, working in the spectral domain, and extracting robust formant-like features for a standard ASR system. HMM2 performs a nonlinear, state-dependent frequency warping, and it is shown that the resulting frequency segmentation actually contains particularly discriminant features. To improve the HMM2 system further, we complement the initial spectral energy vectors with frequency information. Finally, adding temporal information to the HMM2 feature vector yields further improvements. These conclusions are experimentally validated on the Numbers95 database, where word error rates of 15%, using only a 4-dimensional feature vector (3 formant-like parameters and one time index) were obtained.
Keywords :
feature extraction; hidden Markov models; speech recognition; ASR; HMM2; Numbers95 database; automatic speech recognition; feature extraction; frequency segmentation; frequency warping; hidden Markov model; secondary HMM; spectral domain; temporal information; Automatic speech recognition; Data mining; Error analysis; Feature extraction; Frequency; Hidden Markov models; Indexes; Robustness; Spatial databases; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
Type :
conf
DOI :
10.1109/ASRU.2001.1034590
Filename :
1034590
Link To Document :
بازگشت