DocumentCode
2511630
Title
Detecting Group Turn Patterns in Conversations Using Audio-Video Change Scale-Space
Author
Krishnan, RaviKiran ; Sarkar, Sudeep
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
137
Lastpage
140
Abstract
Automatic analysis of conversations is important for extracting high-level descriptions of meetings. In this work, as an alternative to linguistic approaches, we develop a novel, purely bottom-up representation, constructed from both audio and video signals that help us characterize and build a rich description of the content at multiple temporal scales. We consider the evolution of the detected change, using Bayesian Information Criterion (BIC) at multiple temporal scales to build an audio-visual change scale space. Peaks detected in this representation, yields group-turn based conversational changes at different temporal scales. Conversation overlaps, changes and their inferred models offer an intermediate-level description of meeting videos that can be useful in summarization and indexing of meetings. Results on NIST meeting room dataset showed a true positive rate of 88%.
Keywords
Bayes methods; audio signal processing; video signal processing; Bayesian information criterion; audio-video change scale-space; automatic conversation analysis; bottom-up representation; group turn pattern detection; meeting indexing; meeting summarization; meeting videos; multiple temporal scales; Bayesian methods; Diamond-like carbon; Meetings; Mel frequency cepstral coefficient; Principal component analysis; Psychology; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.42
Filename
5597617
Link To Document