• DocumentCode
    1653264
  • Title

    Effective modeling of acoustic confusions for Mandarin CALL system

  • Author

    Ge, Fengpei ; Pan, Fuping ; Liu, Changliang ; Bin Dong ; Yan, Yonghong

  • Author_Institution
    ThinkIT Lab., Chinese Acad. of Sci., Beijing
  • fYear
    2008
  • Firstpage
    663
  • Lastpage
    666
  • Abstract
    Acoustic confusions degrade the accuracy of pronunciation assessment severely in computer assisted language learning (CALL) systems. This paper presents our recent study on effective modeling of the acoustic confusions. We change the traditional Mandarin syllable structure, which is composed of initial and final, to a novel phoneme structure. Several phoneme splitting strategies are investigated, and the question list used for building and merging decision tree is studied. Experiments show that the optimal phoneme splitting strategy outperforms the traditional initial-final structure in our CALL system, with relative 11.05% ASER improvement for nasal finals. This idea may be extended to improve the performance of automatic speech recognition (ASR).
  • Keywords
    computer aided instruction; decision trees; linguistics; speech processing; speech recognition; Mandarin CALL system; Mandarin syllable structure; acoustic confusion modeling; automatic speech recognition; computer assisted language learning system; decision tree merging; phoneme splitting strategy; pronunciation assessment; Acoustic measurements; Automatic speech recognition; Buildings; Decision trees; Decoding; Degradation; Hidden Markov models; Merging; Natural languages; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2008. ICSP 2008. 9th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-2178-7
  • Electronic_ISBN
    978-1-4244-2179-4
  • Type

    conf

  • DOI
    10.1109/ICOSP.2008.4697218
  • Filename
    4697218