DocumentCode
2280401
Title
Automatic accent identification using Gaussian mixture models
Author
Chen, Tao ; Huang, Chao ; Chang, Eric ; Wang, Jingchun
Author_Institution
Microsoft Research China
fYear
2001
fDate
13-13 Dec. 2001
Firstpage
343
Lastpage
346
Abstract
It is well known that speaker variability caused by accent is an important factor io speech recognition. Some major accents in China are so different as to make this problem very severe. We propose a Gaussian mixture model (GMM) based Mandarin accent identitication method. In this method a number of GMMs are trained to identify the most likely accent given test utterances. The identified accent type can be used to select an accent-dependent model for speech recognition. A multi-accent Mandarin corpus was developed for the task, including 4 typical accents in China with 1,440 speakers (l,200 for training, 240 for testing). We explore experimentally the effect of the number of components in GMM on identification performance. We also investigate how many utterances per speaker are sufficient to reliably recognize his/her accent. Finally, we show the correlations among accents and provide some discussion.
Keywords
Chaos; Ferroelectric films; Hidden Markov models; Loudspeakers; Natural languages; Nonvolatile memory; Random access memory; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Conference_Location
Madonna di Campiglio, Italy
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034657
Filename
1034657
Link To Document