• DocumentCode
    2704028
  • Title

    Multi-Pass Pronunciation Adaptation

  • Author

    Bodenstab, Nathan ; Fanty, Mark

  • Author_Institution
    OGI, Oregon Health & Sci. Univ., Portland, OR
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    A mapping between words and pronunciations (potential phonetic realizations) is a key component of speech recognition systems. Traditionally, this has been encoded in a lexicon where each pronunciation is transcribed by a linguist or generated by a grapheme-to-phoneme algorithm. For large vocabulary recognition systems, this process is highly susceptible to errors. We present an off-line data driven algorithm to correct suboptimal pronunciations using transcribed utterances. Unlike previous data driven algorithms that struggle to balance acoustic representation and multi-speaker generalization, our multi-pass approach maximizes both criteria, instead of compromising between the two. We demonstrate on an automated name dialing task that our multi-pass algorithm achieves a 70% error rate reduction when compared to a baseline grapheme-to-phoneme generated lexicon.
  • Keywords
    speech recognition; vocabulary; acoustic representation; automated name dialing task; error rate reduction; grapheme-to-phoneme algorithm; lexicon; multipass pronunciation adaptation; multispeaker generalization; off-line data driven algorithm; pronunciations; speech recognition systems; transcribed utterances; words; Acoustics; Automatic speech recognition; Bismuth; Error analysis; Loudspeakers; Memory; Natural languages; Robustness; Speech recognition; Vocabulary; adaptation; learning; pronunciation; speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367207
  • Filename
    4218238