Title :
Audio packet loss over IP and speech recognition
Author :
Mayorga, Pedro ; Besacier, Laurent ; Lamy, Richard ; Serignat, Jean-Francois
Author_Institution :
GEOD Team-CLIPS Lab., UMR CNRS, Grenoble, France
fDate :
30 Nov.-3 Dec. 2003
Abstract :
This paper deals with the effects of packet loss on speech recognition over IP connections. The performance of our continuous French speech recognition system is here evaluated for different transmission scenarios. A packet loss simulation model is first proposed in order to simulate different channel degradation conditions. The packet loss problem is also investigated in real transmissions through IP. Because packet loss impact may be different according to the speech coder used to transmit data, different transmission conditions with different audio codecs are also investigated. Several reconstruction strategies to recover lost information are then proposed, and tested. Another solution for dialog applications is also suggested, where the relative weight of the language and acoustic model is changed according to the packet loss rate. The results show that the speech recognition performance can be augmented by the solutions here presented.
Keywords :
Internet telephony; speech coding; speech recognition; IP transmission; VoIP; acoustic model weighting; audio codecs; channel degradation conditions; dialog applications; information reconstruction; language model weighting; lost information recovery strategies; packet loss rate; speech coders; speech recognition; voice over IP audio packet loss; word error rate; Acoustic testing; Databases; Degradation; Internet; Laboratories; Performance loss; Propagation losses; Speech analysis; Speech codecs; Speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318509