• DocumentCode
    2065870
  • Title

    Evaluation and Analysis of Minimum Phone Error Training and its Modified Versions for Large Vocabulary Mandarin Speech Recognition

  • Author

    Cheng, Yung-Jen ; Lin, Che-kuang ; Lee, Lin-shan

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper reports a detailed study on minimum phone error (MPE), minimum phone frame error (MPFE), and a physical-state level version of minimum Bayes risk (sMBR) training, as well as several modified versions of them, for transcription of large vocabulary Mandarin broadcast news. We found the results are quite different from these observed previously for English and Arabic broadcast news tasks[l], in particular the trends are different when different performance measures (word and character accuracies) are used. This makes the difference for Chinese language, for which character accuracy is usually more important, while word accuracy is commonly used for other languages. Modifications to these approaches tested here include considering the variable phone length and applying penalties to erroneous frames. They were shown to be able to significantly improve character accuracy in our experiments.
  • Keywords
    Bayes methods; natural language processing; speech recognition; Chinese language; large vocabulary Mandarin speech recognition; minimum Bayes risk; minimum phone frame error; phone error training; Broadcasting; Lattices; Maximum likelihood estimation; Natural languages; Particle measurements; Risk analysis; Speech analysis; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.51
  • Filename
    4730305