DocumentCode
179527
Title
Subspace Gaussian mixture model for computer-assisted language learning
Author
Rong Tong ; Boon Pang Lim ; Chen, Nancy F. ; Bin Ma ; Haizhou Li
Author_Institution
Inst. for Infocomm Res., Singapore, Singapore
fYear
2014
fDate
4-9 May 2014
Firstpage
5347
Lastpage
5351
Abstract
In computer-assisted language learning (CALL), speech data from non-native speakers are usually insufficient for acoustic modeling. Subspace Gaussian Mixture Models (SGMM) have been effective in training automatic speech recognition (ASR) systems with limited amounts of training data. Therefore, in this work, we propose to use SGMM to improve the fluency assessment performance. In particular, the contributions of this work are: (i) The proposed SGMM acoustic model trained with native data outperforms the MMI-GMM/HMM baseline by 25% relative, (ii) when incorporating a small amount of non-native training data, the SGMM acoustic model further improves the performance of fluency assessment by 47% relative.
Keywords
Gaussian processes; acoustic signal processing; computer aided instruction; linguistics; mixture models; natural language processing; speech recognition; ASR systems; CALL; SGMM acoustic modeling; automatic speech recognition; computer-assisted language learning; fluency assessment performance; nonnative speakers; nonnative training data; speech data; subspace Gaussian mixture models; Acoustics; Correlation; Data models; Hidden Markov models; Speech; Speech recognition; Training; Automatic Speech Recognition (ASR); Computer Assisted Language Learning (CALL); Fluency assessment; Goodness Of Pronunciation (GOP); Subspace Gaussian Mixture Model (SGMM);
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854624
Filename
6854624
Link To Document