• DocumentCode
    3132823
  • Title

    A comparison-based approach to mispronunciation detection

  • Author

    Lee, Albert ; Glass, James

  • Author_Institution
    MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    382
  • Lastpage
    387
  • Abstract
    The task of mispronunciation detection for language learning is typically accomplished via automatic speech recognition (ASR). Unfortunately, less than 2% of the world´s languages have an ASR capability, and the conventional process of creating an ASR system requires large quantities of expensive, annotated data. In this paper we report on our efforts to develop a comparison-based framework for detecting word-level mispronunciations in nonnative speech. Dynamic time warping (DTW) is carried out between a student´s (non-native speaker) utterance and a teacher´s (native speaker) utterance, and we focus on extracting word-level and phone-level features that describe the degree of mis-alignment in the warping path and the distance matrix. Experimental results on a Chinese University of Hong Kong (CUHK) nonnative corpus show that the proposed framework improves the relative performance on a mispronounced word detection task by nearly 50% compared to an approach that only considers DTW alignment scores.
  • Keywords
    computer aided instruction; feature extraction; matrix algebra; speech recognition; word processing; ASR; DTW; automatic speech recognition; degree of misalignment; distance matrix; dynamic time warping; language learning; mispronounced word detection; nonnative corpus; nonnative speech; phone level feature; warping path; word level feature; word-level mispronunciation; Feature extraction; Mel frequency cepstral coefficient; Speech; Support vector machines; System performance; Training; Vectors; dynamic time warping; language learning; mispronunciation detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424254
  • Filename
    6424254