DocumentCode
2064844
Title
Decision Fusion for Improving Mispronunciation Detection Using Language Transfer Knowledge and Phoneme-Dependent Pronunciation Scoring
Author
Lo, W.K. ; Harrison, Alissa M. ; Meng, Helen ; Wang, Lan
Author_Institution
Chinese Univ. of Hong Kong, Shenzhen, China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
Application of linguistic knowledge of language transfer to automatic speech recognition (ASR) technology can enhance mispronunciation detection performance in computer-aided pronunciation training (CAPT). This is achieved by pinpointing salient pronunciation errors made by second language learners. In this work, we propose to apply decision fusion for further improvement in mispronunciation detection performance. Detection decision from the linguistically-motivated detection, which applies language transfer knowledge, is used as the basis. Back off to posterior probability based pronunciation scoring with phoneme-dependent thresholds is employed when the basis is "less-reliable". Fusion can help combat problems such as incomplete coverage of linguistic knowledge as well as the imperfection of acoustic models in ASR. Our fusion strategy can maintain the diagnosis capability of the linguistically-motivated approach while achieve a major boost in detection performance. Experimental results show that decision fusion can achieve relative improvement in mispronunciation detection of up to 30% reduction in total number of decision errors.
Keywords
computer based training; linguistics; probability; sensor fusion; speech recognition; CAPT; acoustic model; automatic speech recognition; computer-aided pronunciation training; decision fusion; language transfer knowledge; linguistically-motivated detection; mispronunciation detection; phoneme-dependent pronunciation scoring; Acoustic signal detection; Application software; Automatic speech recognition; Computer errors; Feedback; Hidden Markov models; Loudspeakers; Natural languages; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.18
Filename
4730272
Link To Document