DocumentCode
2016819
Title
Capturing L2 segmental mispronunciations with joint-sequence models in Computer-Aided Pronunciation Training (CAPT)
Author
Qian, Xiaojun ; Meng, Helen ; Soong, Frank
Author_Institution
MoE-Microsoft Key Lab. of Human-Centric Comput. & Interface Technol., CUHK, Hong Kong, China
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
84
Lastpage
88
Abstract
In this study, we present an extension to our previous efforts on automatically detecting text-dependent segmental mispronunciations by Cantonese (L1) learners of American English (L2), through modeling the L2 production. The problem of segmental mispronunciation modeling is addressed by joint-sequence models. Specifically, a grapheme-to-phoneme model is built to convert the prompted words to their corresponding possible mispronunciations, instead of the previous characterization of phonological processes based on a transfer from the canonical phonetic transcription. Experiments show that the approach can capture the mispronunciations better than the knowledge based and data-driven phonological rules.
Keywords
computer based training; knowledge based systems; natural language processing; CAPT; L2 segmental mispronunciations capturing; american English; canonical phonetic transcription; computer aided pronunciation training; data driven phonological rules; grapheme-to-phoneme model; joint sequence models; phonological processes; text dependent segmental mispronunciations; Adaptation model; Hidden Markov models; Joints; Knowledge based systems; Speech; Testing; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684845
Filename
5684845
Link To Document