DocumentCode :
417166
Title :
Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition
Author :
He, Xiaodong ; Zhao, Yunxin
Author_Institution :
Dept. of Comput. Eng. & Comput. Sci., Missouri Univ., USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
An improved method of model complexity selection for nonnative speech recognition is proposed by using maximum a posteriori estimation of bias distributions. An algorithm is described for estimating the hyper-parameters of the prior distributions, and an automatic accent detection algorithm is also proposed for integration with dynamic model selection and adaptation. Experiments were performed on the WSJ1 task with American English speech, British accent speech, and Mandarin Chinese accent speech. Results show that the use of prior knowledge of accents enabled reliable estimation of bias distributions in the case of a very small amount of adaptation speech, or without adaptation speech. Recognition results show that the new approach is superior to the previous MEL (maximum expected likelihood) method, especially when the adaptation data are extremely limited.
Keywords :
maximum likelihood estimation; natural languages; speech recognition; American English; British accent; Mandarin Chinese accent; automatic accent detection; bias distributions; maximum a posteriori estimation; maximum expected likelihood; model selection; nonnative speech recognition; prior knowledge; speaker adaptation; Adaptation model; Computer science; Degradation; Detection algorithms; Distributed computing; Helium; Loudspeakers; Maximum a posteriori estimation; Maximum likelihood linear regression; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325991
Filename :
1325991
Link To Document :
بازگشت