• DocumentCode
    3531166
  • Title

    An efficient mispronounciation detction method using GLDS-SVM and formant enhanced features

  • Author

    Li, Hongyan ; Liang, JiaEn ; Wang, ShiJin ; Xu, Bo

  • Author_Institution
    Digital Content Technol. Res. Center, Chinese Acad. of Sci., Beijing
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4845
  • Lastpage
    4848
  • Abstract
    Mispronunciation detection is an important component in computer assisted language learning (CALL) system. In this work, we introduce an efficient GLDS-SVM based detection method, which is successfully used in language and speaker identification systems, and combine it with traditional methods. The main ideas include: extended MFCC features with normalized formant trajectory information, and then propose a novel multi-model strategy for model training to make full use of samples and solve the problem of data unbalance, finally combine GLDS-SVM method with UBM-GMM system to further improve the performance. Experiments show that GLDS-SVM is highly efficient than traditional RBF-SVM, and the fused system can achieve a significant relative improvement of 17.5% in EER reduction, compared with the baseline UBM-GMM system.
  • Keywords
    Gaussian processes; computer aided instruction; support vector machines; CALL; GLDS-SVM; MFCC; UBM-GMM; computer assisted language learning; generalized linear discriminant sequence-support vector machine; mel-frequency cepstral coefficient; mispronunciation detection method; normalized formant trajectory information; universal background model-Gaussian mixture model; Acoustic signal detection; Automatic testing; Automation; Feedback; Machine learning; Mel frequency cepstral coefficient; Natural languages; Speech recognition; Support vector machine classification; Support vector machines; Computer Assisted Language Learning; Generalized Linear Discriminant Sequence; Mispronunciation Detection; Support Vector Machine; System Fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960716
  • Filename
    4960716