• DocumentCode
    1694953
  • Title

    Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training

  • Author

    Junhong Zhao ; Hua Yuan ; Wai-Kim Leung ; Meng, Hsiang-Yun ; Jia Liu ; Shanhong Xia

  • Author_Institution
    State Key Lab. on Transducing Technol., IECAS, Beijing, China
  • fYear
    2013
  • Firstpage
    8218
  • Lastpage
    8222
  • Abstract
    In second language learning, unawareness of the differences between correct and incorrect pronunciations is one of the largest obstacles for mispronunciation correction. In order to make the feedback more discriminatively perceptible, this paper presents a novel method for corrective feedback generation, namely, exaggerated feedback, for language learning. To produce exaggeration effect, the neutral audio and visual speech are both exaggerated and then re-synthesized synchronously based on the audiovisual synthesis technology. The audio speech exaggeration is realized by adjusting the acoustic features related to duration, pitch and energy of the speech according to different phonemes conditions. The visual speech exaggeration is realized by increasing the range of articulatory movement and slowing down the movement around the key actions. The results show that our methods can effectively generate bimodal exaggeration effect for feedback provision and make them more distinctive to be perceived.
  • Keywords
    audio-visual systems; speech recognition; speech synthesis; acoustic features; articulatory movement; audio speech exaggeration; audiovisual synthesis; bimodal exaggeration effect; computer-assisted pronunciation training; corrective feedback generation; exaggerated feedback; exaggerated speech; mispronunciation correction; neutral audio speech; second language learning; visual speech; Animation; Indexes; Lips; Speech; Synthesizers; Training; Visualization; Computer-assisted pronunciation training; exaggerated feedback; visual-speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639267
  • Filename
    6639267