• DocumentCode
    149039
  • Title

    Efficient rule scoring for improved grapheme-based lexicons

  • Author

    Hartmann, W. ; Lamel, Lori ; Gauvain, Jean-Luc

  • Author_Institution
    Spoken Language Process. Group, LIMSI, Orsay, France
  • fYear
    2014
  • fDate
    1-5 Sept. 2014
  • Firstpage
    1477
  • Lastpage
    1481
  • Abstract
    For many languages, an expert-defined phonetic lexicon may not exist. One popular alternative is the use of a grapheme-based lexicon. However, there may be a significant difference between the orthography and the pronunciation of the language. In our previous work, we proposed a statistical machine translation based approach to improving grapheme-based pronunciations. Without knowledge of true target pronunciations, a phrase table was created where each individual rule improved the likelihood of the training data when applied. The approach improved recognition accuracy, but required significant computational cost. In this work, we propose an improvement that increases the speed of the process by more than 80 times without decreasing recognition accuracy.
  • Keywords
    language translation; speech recognition; statistical analysis; automatic speech recognition; expert-defined phonetic lexicon; grapheme based pronunciations; improved grapheme-based lexicons; language pronunciation; orthography; rule scoring; statistical machine translation based approach; Abstracts; Acoustics; Hidden Markov models; automatic speech recognition; grapheme-based speech recognition; pronunciation learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
  • Conference_Location
    Lisbon
  • Type

    conf

  • Filename
    6952535