• DocumentCode
    2176516
  • Title

    Pronunciation variants generation using SMT-inspired approaches

  • Author

    Karanasou, Panagiota ; Lamel, Lori

  • Author_Institution
    Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4908
  • Lastpage
    4911
  • Abstract
    Enriching a pronunciation dictionary with phonological variation is a challenging task, not yet solved despite several decades of research, in particular for speech-to-text transcription of real world data where it is important to cover different pronunciation variants. This paper proposes two alternative methods, inspired by machine translation, to derive pronunciation variants from an initial lexicon with limited variations. In the first case, an n-best pronunciation list is extracted directly from a machine translation tool, used as a grapheme-to-phoneme (g2p) converter. The second is a novel method based on a pivot approach, previously used for the paraphrase extraction task, and here applied as a post-processing step to the g2p converter. Some preliminary speech recognition experiments with the automatically generated pronunciation variants are reported using Quaero development data.
  • Keywords
    language translation; speech recognition; Quaero development data; SMT-inspired approach; g2p converter; grapheme-to-phoneme converter; machine translation; phonological variation; pronunciation variants generation; speech recognition; speech-to-text transcription; Context; Dictionaries; Error analysis; Hidden Markov models; Maximum likelihood decoding; Speech recognition; Training; English; SMT; g2p conversion; pivot paraphrasing; pronunciation variants;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947456
  • Filename
    5947456