DocumentCode
290123
Title
Phoneme recognition in continuous speech using large inhomogeneous hidden Markov models
Author
Sitaram, R.N.V. ; Sreenivas, T.V.
Author_Institution
Dept. of Electr. Commun. Eng., Indian Inst. of Sci., Bangalore, India
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
We present a novel scheme for phoneme recognition in continuous speech using inhomogeneous hidden Markov models (IHMMs). IHMMs can capture the temporal structure of phonemes and inter-phonemic temporal relationships effectively, with their duration dependent state transition probabilities. A two stage IHMM is proposed to capture the variabilities in speech effectively for phoneme recognition. The first stage models the acoustic and durational variabilities of all distinct sub-phonemic segments and the second stage models the acoustic and durational variability of the whole phoneme. In an experimental evaluation of the new scheme for recognizing a subset of alphabets comprising of the most confusing set of phonemes, spoken randomly and continuously, a phoneme recognition accuracy of 83% is observed
Keywords
acoustic signal processing; hidden Markov models; probability; speech recognition; IHMM; acoustic variabilities; alphabets; continuous speech; durational variabilities; inhomogeneous hidden Markov models; phoneme recognition accuracy; state transition probabilities; sub-phonemic segments; temporal structure; Fluctuations; Hidden Markov models; Probability distribution; Speech analysis; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389360
Filename
389360
Link To Document