مرکز منطقه ای اطلاع رساني علوم و فناوري - Pseudo 2-dimensional hidden Markov models in speech recognition

DocumentCode :

2280766

Title :

Pseudo 2-dimensional hidden Markov models in speech recognition

Author :

Werner, Stefen ; Rigoll, Gerhard

Author_Institution :

Dept. of Comput. Sci., Gerhard Mercator Univ., Duisburg, Germany

fYear :

2001

fDate :

2001

Firstpage :

441

Lastpage :

444

Abstract :

In this paper, the usage of pseudo 2-dimensional hidden Markov models for speech recognition is discussed. This image processing method should better model the time-frequency structure in speech signals. The method calculates the emission probability of a standard HMM by embedded HMM for each state. If a temporal sequence of spectral vectors is imagined as a spectrogram, this leads to a 2-dimensional warping of the spectrogram. This additional warping of the frequency axis could be useful for speaker-independent recognition and can be considered to be similar to a vocal tract normalization. The effects of this paradigm are investigated in this paper using the TI-Digits database.

Keywords :

feature extraction; hidden Markov models; probability; spectral analysis; speech recognition; time-frequency analysis; 2-dimensional warping; TI-Digits database; embedded HMM; emission probability; hidden Markov models; image processing method; pseudo 2-dimensional HMM; speaker-independent recognition; spectral vectors; spectrogram; speech recognition; speech signals; temporal sequence; time-frequency structure; vocal tract normalization; Computer science; Databases; Feature extraction; Frequency; Hidden Markov models; Image processing; Signal processing; Spectrogram; Speech processing; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on

Print_ISBN :

0-7803-7343-X

Type :

conf

DOI :

10.1109/ASRU.2001.1034679

Filename :

1034679

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2280766