• DocumentCode
    2703689
  • Title

    Modelling Pronunciation Variation using Multi-Path HMMS for Syllables

  • Author

    Hamalainen, A. ; Bosch, L. ; Boves, L.

  • Author_Institution
    Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Netherlands
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their reestimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
  • Keywords
    hidden Markov models; speech recognition; Dutch read speech recognition task; modelling pronunciation variation; multi-path HMM; phonetic knowledge; syllable-length acoustic models; triphone recogniser; Appropriate technology; Automatic speech recognition; Data mining; Displays; Hidden Markov models; Libraries; Natural languages; Speech recognition; Topology; Training data; Speech recognition; hidden Markov models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367029
  • Filename
    4218217