Title :
An HMM-based formalism for automatic subword unit derivation and pronunciation generation
Author :
Razavi, Marzieh ; Magimai-Doss, Mathew
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Abstract :
We propose a novel hidden Markov model (HMM) formalism for automatic derivation of subword units and pronunciation generation using only transcribed speech data. In this approach, the subword units are derived from the clustered context-dependent units in a grapheme based system using maximum-likelihood criterion. The subword unit based pronunciations are then learned in the framework of Kullback-Leibler divergence based HMM. The automatic speech recognition (ASR) experiments on WSJ0 English corpus show that the approach leads to 12.7% relative reduction in word error rate compared to grapheme-based system. Our approach can be beneficial in reducing the need for expert knowledge in development of ASR as well as text-to-speech systems.
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; ASR experiments; HMM formalism; Kullback-Leibler divergence based HMM; WSJ0 English corpus; automatic speech recognition experiments; clustered context-dependent units; grapheme based system; hidden Markov model formalism; maximum-likelihood criterion; pronunciation generation; subword units automatic derivation; text-to-speech systems; transcribed speech data; word error rate; Acoustics; Hidden Markov models; Mathematical model; Probabilistic logic; Speech; Speech recognition; Training; Kullback-Leibler divergence based hidden Markov model; automatic subword unit derivation; hidden Markov model; pronunciation generation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178850