DocumentCode
2181253
Title
Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method
Author
Li, Hongyan ; Huang, Shen ; Wang, ShiJin ; Liang, JiaEn ; Xu, Bo
Author_Institution
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear
2011
fDate
22-27 May 2011
Firstpage
5668
Lastpage
5671
Abstract
In the task of mispronunciation detection, the cross-speaker degradation and some other confusing nuisances are the challenging problems demanding prompt solution. In this paper, we will attempt to remove the non-pronunciation variations in the GLDS-SVM expansion space by using nuisance attribute projection strategy, in order to increase the separating capacity between different phoneme instances. Moreover, different kinds of score normalization methods with softmax, posterior probability vector (PPV), Z-norm and T-norm are comparatively discussed. The experiments on three kinds of speech corpora demonstrate the effectiveness of the above methods, and the performance improvement is not very significant, but sustainable.
Keywords
probability; speaker recognition; support vector machines; GLDS-SVM expansion; cross-speaker degradation; generalized linear discriminant sequence kernel; mispronunciation detection; nuisance attribute projection; posterior probability vector; score normalization; speech corpora; Computational modeling; Kernel; Speaker recognition; Speech; Support vector machines; Testing; Training; Automatic mispronunciation detection; generalized linear discriminant sequence; nuisance attribute projection; score normalization; support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947646
Filename
5947646
Link To Document