Title :
Latency in Speech Feature Analysis for Telepresence Event Coding
Author :
O´Gorman, Lawrence
Author_Institution :
Alcatel-Lucent Bell Labs., Murray Hill, NJ, USA
Abstract :
For videoconferencing, there are network bandwidth and screen real-estate constraints that limit the number of user channels. We propose an intermediate transmission mode that transmits only at "events", where these are detected by both audio and video changes from the short-term signal average. Our objective in this paper is to determine latency until the audio portion of a single telepresence channel stabilizes. It is this stable signal from which we detect events. We describe a recursive filter approach for feature determination and experiments on the Switchboard telephone call database. Results show latency to stable signal of up to 10 seconds. Although events can be detected much more quickly (<;;1 sec) if the signal is already stable, this latency time must be considered at the start of a conversation or at changes in the short-term average.
Keywords :
speech processing; teleconferencing; video coding; latency time; network bandwidth; recursive filter; screen real-estate constraint; speech feature analysis; telepresence event coding; transmission mode; videoconferencing; Equations; IIR filters; Measurement uncertainty; Noise; Noise measurement; Speech; Time measurement; anomaly detection; event detection; speech feature analysis;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1084