• DocumentCode
    730808
  • Title

    Integrated pronunciation learning for automatic speech recognition using probabilistic lexical modeling

  • Author

    Rasipuram, Ramya ; Razavi, Marzieh ; Magimai-Doss, Mathew

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5176
  • Lastpage
    5180
  • Abstract
    Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared by linguistic experts. When the hand crafted pronunciations fail to cover the vocabulary of a new domain, a grapheme-to-phoneme (G2P) converter is used to extract pronunciations for new words and then a phoneme based ASR system is trained. G2P converters are typically trained only on the existing lexicons. In this paper, we propose a grapheme based ASR approach in the framework of probabilistic lexical modeling that integrates pronunciation learning as a stage in ASR system training, and exploits both acoustic and lexical resources (not necessarily from the domain or language of interest). The proposed approach is evaluated on four lexical resource constrained ASR tasks and compared with the conventional two stage approach where G2P training is followed by ASR system development.
  • Keywords
    learning (artificial intelligence); probability; speech recognition; ASR system training; G2P converters; automatic speech recognition; crafted pronunciations; grapheme-to-phoneme; integrated pronunciation learning; linguistic experts; phoneme based pronunciation lexicon; probabilistic lexical modeling; Acoustics; Hidden Markov models; Probabilistic logic; Speech; Speech recognition; Training; Vocabulary; Probabilistic lexical modeling; grapheme subwords; grapheme-tophoneme conversion; phoneme subwords; pronunciation lexicon;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178958
  • Filename
    7178958