DocumentCode
2703017
Title
Modeling Duration via Lattice Rescoring
Author
Jennequin, N. ; Gauvain, J. -L.
Author_Institution
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
It is often acknowledged that HMMs do not properly model phone and word durations. In this paper phone and word duration models are used to improve the accuracy of state-of-the-art large vocabulary speech recognition systems. The duration information is integrated into the systems in a rescoring of word lattices that include phone-level segmentations. Experimental results are given for a conversational telephone speech (CTS) task in French and for the TC-Star EPPS transcription task in Spanish and English. An absolute word error rate reduction of about 0.5% is observed for the CTS task, and smaller but consistent gains are observed for the EPPS task in both languages.
Keywords
hidden Markov models; speech processing; speech recognition; English; French; HMM; Spanish; conversational telephone speech; large vocabulary speech recognition systems; lattice rescoring; phone models; phone-level segmentations; word duration models; word error rate reduction; Decoding; Error analysis; Hidden Markov models; Lattices; Natural languages; Speech recognition; Telephony; Topology; Training data; Vocabulary; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366994
Filename
4218182
Link To Document