Title :
Forward optimal measures for automatic mispronunciation detection
Author :
Liu, Changliang ; Pan, Fuping ; Ge, Fengpei ; Dong, Bin ; Yan, Yonghong
Author_Institution :
Inst. of Acoust., Chinese Acad. of Sci., Beijing, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
Pronunciation measure computation is a vital part of Computer Assisted Pronunciation Training (CAPT) system. This paper conducts some research on pronunciation measures based on the two popular measures - Log posterior probability (LPP) and Goodness of Pronunciation (GOP). A modified GOP - AGOP is proposed which directly uses the segmentation information of forced alignment instead of free phone recognizer (FPR) when computing the denominator of GOP to avoid the effect of inaccuracy of FPR. The context dependent acoustic models is investigated in mispronunciation detection. It is found that Tri-phone AM has better performance in mispronunciation detection of continuous speech. This paper also proposes a fast algorithm of pronunciation measure - FAGOP which uses the maximization instead of summation to calculate the denominator of AGOP approximately and applies Viterbi algorithm with some effective pruning strategy to reduce the computation perplexity. It achieves much better efficiency while barely impairing the detection presicion.
Keywords :
maximum likelihood estimation; probability; speech recognition; Viterbi algorithm; automatic mispronunciation detection; computation perplexity; computer assisted pronunciation training; free phone recognizer; goodness of pronunciation; log posterior probability; Acoustic measurements; Computational modeling; Context; Hidden Markov models; Mathematical model; Probability; Speech;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684844