DocumentCode :
3283091
Title :
Multi-stream segmentation of meetings
Author :
Dielmann, Alfred ; Renals, Steve
Author_Institution :
Centre for Speech Technol. Res., Edinburgh Univ., UK
fYear :
2004
fDate :
29 Sept.-1 Oct. 2004
Firstpage :
167
Lastpage :
170
Abstract :
This paper investigates the automatic segmentation of meetings into a sequence of group actions or phases. Our work is based on a corpus of multiparty meetings collected in a meeting room instrumented with video cameras, lapel microphones and a microphone array. We have extracted a set of feature streams, in this case extracted from the audio data, based on speaker turns, prosody and a transcript of what was spoken. We have related these signals to the higher level semantic categories via a multistream statistical model based on dynamic Bayesian networks (DBNs). We report on a set of experiments in which different DBN architectures are compared, together with the different feature streams. The resultant system has an action error rate of 9%.
Keywords :
audio signal processing; belief networks; feature extraction; image segmentation; image sequences; microphone arrays; multimedia communication; statistical analysis; video cameras; video signal processing; audio extraction; dynamic Bayesian network; meeting segmentation; microphone array; multistream segmentation; video camera; Bayesian methods; Cameras; Data mining; Dictionaries; Error analysis; Instruments; Microphone arrays; Paper technology; Speech; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2004 IEEE 6th Workshop on
Print_ISBN :
0-7803-8578-0
Type :
conf
DOI :
10.1109/MMSP.2004.1436458
Filename :
1436458
Link To Document :
بازگشت