Title :
Audiovisual discrimination between laughter and speech
Author :
Petridis, Stavros ; Pantic, Maja
Author_Institution :
Imperial Coll. London, London
fDate :
March 31 2008-April 4 2008
Abstract :
Past research on automatic laughter detection has focused mainly on audio-based detection. Here we present an audiovisual approach to distinguishing laughter from speech and we show that integrating the information from audio and video leads to an improved reliability of audiovisual approach in comparison to single-modal approaches. We also investigated the level at which audiovisual information should be fused for the best performance. When tested on 96 audiovisual sequences depicting spontaneously displayed (as opposed to posed) laughter and speech episodes, the proposed audiovisual feature-level approach achieved a 86.9% recall rate with 76.7% precision.
Keywords :
audio-visual systems; speech recognition; video signal processing; audiovisual data processing; audiovisual discrimination; laughter detection; nonlinguistic information processing; single-modal approaches; Ambient intelligence; Audio recording; Detectors; Face detection; Hidden Markov models; Natural languages; Speech recognition; Support vector machine classification; Support vector machines; Video recording; Audiovisual data processing; data fusion; laughter detection; nonlinguistic information processing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518810