• DocumentCode
    2703017
  • Title

    Modeling Duration via Lattice Rescoring

  • Author

    Jennequin, N. ; Gauvain, J. -L.

  • Author_Institution
    Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    It is often acknowledged that HMMs do not properly model phone and word durations. In this paper phone and word duration models are used to improve the accuracy of state-of-the-art large vocabulary speech recognition systems. The duration information is integrated into the systems in a rescoring of word lattices that include phone-level segmentations. Experimental results are given for a conversational telephone speech (CTS) task in French and for the TC-Star EPPS transcription task in Spanish and English. An absolute word error rate reduction of about 0.5% is observed for the CTS task, and smaller but consistent gains are observed for the EPPS task in both languages.
  • Keywords
    hidden Markov models; speech processing; speech recognition; English; French; HMM; Spanish; conversational telephone speech; large vocabulary speech recognition systems; lattice rescoring; phone models; phone-level segmentations; word duration models; word error rate reduction; Decoding; Error analysis; Hidden Markov models; Lattices; Natural languages; Speech recognition; Telephony; Topology; Training data; Vocabulary; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366994
  • Filename
    4218182