• DocumentCode
    3244348
  • Title

    Transcribing Mandarin broadcast news

  • Author

    Chen, Langzhou ; Lame, Lori ; Gauvain, Jean-Luc

  • Author_Institution
    Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    99
  • Lastpage
    104
  • Abstract
    The paper describes improvements to the LIMSI broadcast news transcription system for the Mandarin language in preparation for the DARPA/NIST Rich Transcription 2003 (RT´03) evaluation. The transcription system has been substantially updated to deal with the varied acoustic and linguistic characteristics of the RT´03 test conditions. The major improvements come from the use of lightly supervised acoustic model training in order to benefit from unannotated audio data, the use of source specific language models, and MDI adaptation to tune the language models for sources with limited amounts of training data. The character error rate on the development data has been reduced from 34.5% with the baseline system to 22.6% with the evaluation system.
  • Keywords
    acoustics; error statistics; learning (artificial intelligence); linguistics; natural languages; speech recognition; DARPA/NIST Rich Transcription 2003; MDI adaptation; Mandarin language; acoustic characteristics; acoustic model training; broadcast news transcription system; character error rate; linguistic characteristics; source specific language models; unannotated audio data; Acoustic testing; Adaptation model; Broadcasting; Error analysis; NIST; Natural languages; Speech analysis; Speech recognition; System testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318411
  • Filename
    1318411