Title :
Automatic phoneme recognition with Segmental Hidden Markov Models
Author :
Baghdasaryan, Areg G. ; Beex, A.A.
Author_Institution :
Bradley Dept. of Electr. & Comput. Eng., Virginia Tech, Blacksburg, VA, USA
Abstract :
A new technique is presented for the joint recognition and segmentation task formulated for a speaker independent continuous phoneme recognition and segmentation system. We investigate a strictly probabilistic approach for simultaneous phoneme sequence segmentation and recognition. The implemented automatic phoneme recognition system integrates phoneme length statistics as well as phoneme transition statistics into the Segmental Hidden Markov Model (SHMM). A variation of the Viterbi Search algorithm is employed for estimating the most likely sequence of phonetic symbols as well as their corresponding segment boundaries. The Segmental HMM topology essentially models a phonetic symbol string with a double layer Hidden Markov Model (HMM), with each phonetic symbol in the Segmental HMM modeled with a left-to-right HMM. Our approach lays the groundwork for further expansion of Segmental HMM design to context dependent continuous phoneme recognition systems.
Keywords :
hidden Markov models; probability; speech recognition; SHMM; automatic phoneme recognition; joint recognition; joint segmenatation; phoneme sequence recognition; phoneme sequence segmentation; probabilistic approach; segmental hidden Markov models; speaker independent continuous phoneme recognition; viterbi search algorithm; Arrays; Computational modeling; Hidden Markov models; Indexes; Speech; Speech recognition; Viterbi algorithm; Phoneme Recognition; Segmental Hidden Markov Model; Viterbi Search;
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2011 Conference Record of the Forty Fifth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4673-0321-7
DOI :
10.1109/ACSSC.2011.6190066