• DocumentCode
    2016819
  • Title

    Capturing L2 segmental mispronunciations with joint-sequence models in Computer-Aided Pronunciation Training (CAPT)

  • Author

    Qian, Xiaojun ; Meng, Helen ; Soong, Frank

  • Author_Institution
    MoE-Microsoft Key Lab. of Human-Centric Comput. & Interface Technol., CUHK, Hong Kong, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    84
  • Lastpage
    88
  • Abstract
    In this study, we present an extension to our previous efforts on automatically detecting text-dependent segmental mispronunciations by Cantonese (L1) learners of American English (L2), through modeling the L2 production. The problem of segmental mispronunciation modeling is addressed by joint-sequence models. Specifically, a grapheme-to-phoneme model is built to convert the prompted words to their corresponding possible mispronunciations, instead of the previous characterization of phonological processes based on a transfer from the canonical phonetic transcription. Experiments show that the approach can capture the mispronunciations better than the knowledge based and data-driven phonological rules.
  • Keywords
    computer based training; knowledge based systems; natural language processing; CAPT; L2 segmental mispronunciations capturing; american English; canonical phonetic transcription; computer aided pronunciation training; data driven phonological rules; grapheme-to-phoneme model; joint sequence models; phonological processes; text dependent segmental mispronunciations; Adaptation model; Hidden Markov models; Joints; Knowledge based systems; Speech; Testing; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684845
  • Filename
    5684845