DocumentCode :
638981
Title :
An audio-visual approach to learning salient behaviors in couples´ problem solving discussions
Author :
Gibson, J. ; Bo Xiao ; Georgiou, Panayiotis G. ; Narayanan, Shrikanth
Author_Institution :
Signal Anal. & Interpretation Lab., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
4
Abstract :
We present a method for characterizing salient behavioral events from audio-visual data of dyadic human interactions. This behavioral signal processing work is aimed at supporting observational analysis of domain experts such as psychologists and clinicians. We extract prosodic and spectral speech features as well as visual motion vector features on overlapping windows from a multimodal corpus. We then apply a technique called multiple instance learning to detect salient audio and visual instances for predicting human expert annotated behavior ratings. We demonstrate the performance gains achieved through multimodal fusion in characterizing complex behavior patterns of interest such as blame and acceptance in recordings of couples´ problem solving discussions during marital therapy.
Keywords :
audio signal processing; behavioural sciences computing; feature extraction; image fusion; image motion analysis; learning (artificial intelligence); spectral analysis; speech processing; Couples therapy corpus; audio-visual approach; behavioral signal processing work; dyadic human interactions; human expert annotated behavior rating prediction; marital therapy; multimodal corpus; multimodal fusion; multiple instance learning; observational analysis; overlapping windows; performance gains; problem solving discussions; prosodic speech feature extraction; salient audio instance detection; salient behavior learning; salient behavioral events; spectral speech feature extraction; visual instance detection; visual motion vector feature extraction; Accuracy; Feature extraction; Hidden Markov models; Medical treatment; Speech; Vectors; Visualization; audio-visual signal processing; behavioral signal processing; multiple instance learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
Type :
conf
DOI :
10.1109/ICMEW.2013.6618248
Filename :
6618248
Link To Document :
بازگشت