• DocumentCode
    417136
  • Title

    Joint decoding for phoneme-grapheme continuous speech recognition

  • Author

    Doss, Mathew Magimai ; Bengio, Samy ; Bourlard, Hervé

  • Author_Institution
    Dalle Molle Inst. for Artificial Intelligence, Martigny, Switzerland
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Standard ASR systems typically use phonemes as the subword units. Preliminary studies have shown that the performance of ASR systems could be improved by using graphemes as additional subword units. We investigate such a system where the word models are defined in terms of two different subword units, i.e., phoneme and grapheme. During training, models for both the subword units are trained, and then, during recognition, either both or just one subword unit is used. We have studied this system for a continuous speech recognition task in American English. Our studies show that grapheme information used along with phoneme information improves the performance of ASR.
  • Keywords
    decoding; learning (artificial intelligence); speech recognition; ASR systems; American English; automatic speech recognition; joint decoding; phoneme-grapheme continuous speech recognition; subword units; training; Artificial intelligence; Automatic speech recognition; Decision trees; Decoding; Hidden Markov models; Natural languages; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325951
  • Filename
    1325951