Title :
Detecting Group Turn Patterns in Conversations Using Audio-Video Change Scale-Space
Author :
Krishnan, RaviKiran ; Sarkar, Sudeep
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
Abstract :
Automatic analysis of conversations is important for extracting high-level descriptions of meetings. In this work, as an alternative to linguistic approaches, we develop a novel, purely bottom-up representation, constructed from both audio and video signals that help us characterize and build a rich description of the content at multiple temporal scales. We consider the evolution of the detected change, using Bayesian Information Criterion (BIC) at multiple temporal scales to build an audio-visual change scale space. Peaks detected in this representation, yields group-turn based conversational changes at different temporal scales. Conversation overlaps, changes and their inferred models offer an intermediate-level description of meeting videos that can be useful in summarization and indexing of meetings. Results on NIST meeting room dataset showed a true positive rate of 88%.
Keywords :
Bayes methods; audio signal processing; video signal processing; Bayesian information criterion; audio-video change scale-space; automatic conversation analysis; bottom-up representation; group turn pattern detection; meeting indexing; meeting summarization; meeting videos; multiple temporal scales; Bayesian methods; Diamond-like carbon; Meetings; Mel frequency cepstral coefficient; Principal component analysis; Psychology; Speech;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.42