• DocumentCode
    2064924
  • Title

    Improving Automatic Evaluation of Mandarin Pronunciation with Speaker Adaptive Training (SAT) and MLLR Speaker Adaption

  • Author

    Huang, Chao ; Zhang, Feng ; Soong, Frank K.

  • Author_Institution
    iFlytek Speech Lab., Univ. of Sci. & Technol. of China, China
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Automatic pronunciation evaluation (APE) can be implemented with a speech recognition model trained by standard, "golden" speakers. The pronunciation accuracy is then measured with the Goodness of Pronunciation (GOP) as reported in our earlier work [1]. In this paper, we investigate two main strategies for improving the evaluation: speaker adaptive training (SAT) for reducing the speaker-specific characteristics in model training and MLLR-based speaker adaptation in evaluation for reducing mismatch between the trained model and a testing speaker. Overall, the proposed strategies improve the correlation between evaluations made by APE and human experts from 0.69 to 0.76, approaching the upper bound value of 0.78 among human expert evaluators. Additionally, APE also shows a consistency of 0.93 better than the consistency of 0.83 among human experts.
  • Keywords
    speaker recognition; speech recognition; MLLR speaker adaption; Mandarin pronunciation; automatic pronunciation evaluation; golden speakers; goodness of pronunciation; speaker adaptive training; speech recognition model; Acoustic measurements; Asia; Chaos; Databases; Humans; Loudspeakers; Maximum likelihood linear regression; Speech analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.21
  • Filename
    4730275