DocumentCode :
2280401
Title :
Automatic accent identification using Gaussian mixture models
Author :
Chen, Tao ; Huang, Chao ; Chang, Eric ; Wang, Jingchun
Author_Institution :
Microsoft Research China
fYear :
2001
fDate :
13-13 Dec. 2001
Firstpage :
343
Lastpage :
346
Abstract :
It is well known that speaker variability caused by accent is an important factor io speech recognition. Some major accents in China are so different as to make this problem very severe. We propose a Gaussian mixture model (GMM) based Mandarin accent identitication method. In this method a number of GMMs are trained to identify the most likely accent given test utterances. The identified accent type can be used to select an accent-dependent model for speech recognition. A multi-accent Mandarin corpus was developed for the task, including 4 typical accents in China with 1,440 speakers (l,200 for training, 240 for testing). We explore experimentally the effect of the number of components in GMM on identification performance. We also investigate how many utterances per speaker are sufficient to reliably recognize his/her accent. Finally, we show the correlations among accents and provide some discussion.
Keywords :
Chaos; Ferroelectric films; Hidden Markov models; Loudspeakers; Natural languages; Nonvolatile memory; Random access memory; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Conference_Location :
Madonna di Campiglio, Italy
Print_ISBN :
0-7803-7343-X
Type :
conf
DOI :
10.1109/ASRU.2001.1034657
Filename :
1034657
Link To Document :
بازگشت