DocumentCode :
3132823
Title :
A comparison-based approach to mispronunciation detection
Author :
Lee, Albert ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
382
Lastpage :
387
Abstract :
The task of mispronunciation detection for language learning is typically accomplished via automatic speech recognition (ASR). Unfortunately, less than 2% of the world´s languages have an ASR capability, and the conventional process of creating an ASR system requires large quantities of expensive, annotated data. In this paper we report on our efforts to develop a comparison-based framework for detecting word-level mispronunciations in nonnative speech. Dynamic time warping (DTW) is carried out between a student´s (non-native speaker) utterance and a teacher´s (native speaker) utterance, and we focus on extracting word-level and phone-level features that describe the degree of mis-alignment in the warping path and the distance matrix. Experimental results on a Chinese University of Hong Kong (CUHK) nonnative corpus show that the proposed framework improves the relative performance on a mispronounced word detection task by nearly 50% compared to an approach that only considers DTW alignment scores.
Keywords :
computer aided instruction; feature extraction; matrix algebra; speech recognition; word processing; ASR; DTW; automatic speech recognition; degree of misalignment; distance matrix; dynamic time warping; language learning; mispronounced word detection; nonnative corpus; nonnative speech; phone level feature; warping path; word level feature; word-level mispronunciation; Feature extraction; Mel frequency cepstral coefficient; Speech; Support vector machines; System performance; Training; Vectors; dynamic time warping; language learning; mispronunciation detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424254
Filename :
6424254
Link To Document :
بازگشت