DocumentCode :
2506493
Title :
Latency in Speech Feature Analysis for Telepresence Event Coding
Author :
O´Gorman, Lawrence
Author_Institution :
Alcatel-Lucent Bell Labs., Murray Hill, NJ, USA
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4464
Lastpage :
4467
Abstract :
For videoconferencing, there are network bandwidth and screen real-estate constraints that limit the number of user channels. We propose an intermediate transmission mode that transmits only at "events", where these are detected by both audio and video changes from the short-term signal average. Our objective in this paper is to determine latency until the audio portion of a single telepresence channel stabilizes. It is this stable signal from which we detect events. We describe a recursive filter approach for feature determination and experiments on the Switchboard telephone call database. Results show latency to stable signal of up to 10 seconds. Although events can be detected much more quickly (<;;1 sec) if the signal is already stable, this latency time must be considered at the start of a conversation or at changes in the short-term average.
Keywords :
speech processing; teleconferencing; video coding; latency time; network bandwidth; recursive filter; screen real-estate constraint; speech feature analysis; telepresence event coding; transmission mode; videoconferencing; Equations; IIR filters; Measurement uncertainty; Noise; Noise measurement; Speech; Time measurement; anomaly detection; event detection; speech feature analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1084
Filename :
5597378
Link To Document :
بازگشت