Title :
Low-latency online speaker tracking on the AMI Corpus of meeting conversations
Author :
Zamalloa, Maider ; Rodriguez-Fuentes, Luis Javier ; Bordel, German ; Penagarikano, Mike ; Uribe, J.P.
Author_Institution :
Dept. of Electr. & Electron., GTTS, Univ. of the Basque Country, San Sebastian, Spain
Abstract :
Ambient Inteligence aims to create smart spaces providing services in a transparent and non-intrusive fashion, so context awareness and user adaptation are key issues. Speech can be exploited for user adaptation in such scenarios by continuously tracking speaker identity. However, most speaker tracking approaches require processing the full audio recording before determining speaker turns, which makes them unsuitable for online processing and low-latency decision-making. In this work a low-latency speaker tracking system is presented, which deals with continuous audio streams and outputs decisions at one-second intervals, by scoring fixed-length audio segments with a set of target speaker models. A smoothing technique is explored, based on the scores of past segments, which increases the robustness of tracking decisions to local variability. Experimental results are reported on the AMI Corpus of meeting conversations, revealing the effectiveness of the proposed approach when compared to an offline speaker tracking approach developed for reference.
Keywords :
audio recording; audio signal processing; audio streaming; decision making; smoothing methods; speaker recognition; tracking; AMI corpus; audio recording; context awareness; decision making; fixed length audio segment; low latency online speaker tracking; meeting conversation; smoothing technique; Ambient intelligence; Audio recording; Cepstral analysis; Decision making; Loudspeakers; Robustness; Space technology; Speech; Streaming media; Target tracking; AMI Corpus; Ambient Intelligence; Low-latency; Speaker Recognition; Speaker Tracking;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495089