DocumentCode
2703689
Title
Modelling Pronunciation Variation using Multi-Path HMMS for Syllables
Author
Hamalainen, A. ; Bosch, L. ; Boves, L.
Author_Institution
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Netherlands
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their reestimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
Keywords
hidden Markov models; speech recognition; Dutch read speech recognition task; modelling pronunciation variation; multi-path HMM; phonetic knowledge; syllable-length acoustic models; triphone recogniser; Appropriate technology; Automatic speech recognition; Data mining; Displays; Hidden Markov models; Libraries; Natural languages; Speech recognition; Topology; Training data; Speech recognition; hidden Markov models;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367029
Filename
4218217
Link To Document