DocumentCode
3348269
Title
Dynamic Bayesian networks for meeting structuring
Author
Dielmann, Alfred ; Renals, Steve
Author_Institution
Centre for Speech Technol. Res., Edinburgh Univ., UK
Volume
5
fYear
2004
fDate
17-21 May 2004
Abstract
The paper is about the automatic structuring of multiparty meetings using audio information. We have used a corpus of 53 meetings, recorded using a microphone array and lapel microphones for each participant. The task was to segment meetings into a sequence of meeting actions, or phases. We have adopted a statistical approach using dynamic Bayesian networks (DBNs). Two DBN architectures were investigated: a two-level hidden Markov model (HMM) in which the acoustic observations were concatenated; and a multistream DBN in which two separate observation sequences were modelled. We have also explored the use of counter variables to constrain the number of action transitions. Experimental results indicate that the DBN architectures are an improvement over a simple baseline HMM, with the multistream DBN with counter constraints producing an action error rate of 6%.
Keywords
audio signal processing; belief networks; hidden Markov models; pattern recognition; HMM; automatic meeting structuring; dynamic Bayesian networks; hidden Markov model; lapel microphones; microphone array; Audio recording; Bayesian methods; Concatenated codes; Counting circuits; Error analysis; Hidden Markov models; Humans; Microphone arrays; Speech; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1327189
Filename
1327189
Link To Document