• DocumentCode
    2481371
  • Title

    Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition

  • Author

    Zhang, Shilei ; Shi, Qin ; Qin, Yong

  • Author_Institution
    IBM Res.-China, Beijing, China
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    1606
  • Lastpage
    1609
  • Abstract
    Pronunciation variation is a natural and inevitable phenomenon in an accented Mandarin speech recognition application. In this paper, we integrate knowledge-based and data-driven approaches together for syllable-based pronunciation variation modeling to improve the performance of Mandarin speech recognition system for speakers with Southern accent. First, we generate the syllable-based pronunciation variation rules of Southern accent observed from the training corpus by Chinese linguistic expert. Second, dictionary augmentation with multiple pronunciation variants and pronunciation probability derived from forced alignment statistics of training data. The acoustic models will be retrained based on the new expansion dictionary. Finally, pronunciation variation adaptation will be performed to further fit the data on the decoding stage by taking distribution of variation rules clusters of testing set into account. The experimental results show that the proposed method provides a flexible framework to improve the recognition performance for accented speech effectively.
  • Keywords
    computational linguistics; natural language processing; probability; speech recognition; Chinese linguistic expert; Southern accent; accented Mandarin speech recognition; acoustic model; alignment statistics; data-driven approach; expansion dictionary; knowledge-based approach; pronunciation probability; pronunciation variation adaptation; syllable-based pronunciation variation modeling; syllable-based pronunciation variation rules; training data; Adaptation model; Artificial intelligence; Dictionaries; Histograms; Speech recognition; Testing; Training; accented Mandarin; expansion dictionary; pronunciation variation; syllable-based rules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.397
  • Filename
    5595975