• DocumentCode
    2280401
  • Title

    Automatic accent identification using Gaussian mixture models

  • Author

    Chen, Tao ; Huang, Chao ; Chang, Eric ; Wang, Jingchun

  • Author_Institution
    Microsoft Research China
  • fYear
    2001
  • fDate
    13-13 Dec. 2001
  • Firstpage
    343
  • Lastpage
    346
  • Abstract
    It is well known that speaker variability caused by accent is an important factor io speech recognition. Some major accents in China are so different as to make this problem very severe. We propose a Gaussian mixture model (GMM) based Mandarin accent identitication method. In this method a number of GMMs are trained to identify the most likely accent given test utterances. The identified accent type can be used to select an accent-dependent model for speech recognition. A multi-accent Mandarin corpus was developed for the task, including 4 typical accents in China with 1,440 speakers (l,200 for training, 240 for testing). We explore experimentally the effect of the number of components in GMM on identification performance. We also investigate how many utterances per speaker are sufficient to reliably recognize his/her accent. Finally, we show the correlations among accents and provide some discussion.
  • Keywords
    Chaos; Ferroelectric films; Hidden Markov models; Loudspeakers; Natural languages; Nonvolatile memory; Random access memory; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Conference_Location
    Madonna di Campiglio, Italy
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034657
  • Filename
    1034657