Title :
Automatic assessment of English learner pronunciation using discriminative classifiers
Author :
Nicolao, Mauro ; Beeston, Amy V. ; Hain, Thomas
Author_Institution :
Dept. of Comput. Sci., Univ. of Sheffield, Sheffield, UK
Abstract :
This paper presents a novel system for automatic assessment of pronunciation quality of English learner speech, based on deep neural network (DNN) features and phoneme specific discriminative classifiers. DNNs trained on a large corpus of native and non-native learner speech are used to extract phoneme posterior probabilities. A part of the corpus includes per phone teacher annotations, which allows training of two Gaussian Mixture Models (GMM), representing correct pronunciations and typical error patterns. The likelihood ratio is then obtained for each observed phone. Several models were evaluated on a large corpus of English-learning students, with a variety of skill levels, and aged 13 upwards. The cross-correlation of the best system and average human annotator reference scores is 0.72, with miss and false alarm rate around 19%. Automatic assessment is 81.6% correct with a high degree of confidence. The new approach significantly outperforms spectral distance based baseline systems.
Keywords :
Gaussian processes; mixture models; neural nets; speech recognition; DNN features; GMM; Gaussian mixture models; automatic assessment; deep neural network; english learner pronunciation; english learner speech; english-learning students; human annotator reference scores; likelihood ratio; nonnative learner speech; per phone teacher annotations; phoneme posterior probability; phoneme specific discriminative classifiers; pronunciation quality; Acoustics; Feature extraction; Neural networks; Nickel; Regression tree analysis; Speech; Training; Computer-Assisted Language Learning; DNN-GMM; Pronunciation assessment; binary classifier;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178993