DocumentCode :
3427615
Title :
Audiovisual discrimination between laughter and speech
Author :
Petridis, Stavros ; Pantic, Maja
Author_Institution :
Imperial Coll. London, London
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
5117
Lastpage :
5120
Abstract :
Past research on automatic laughter detection has focused mainly on audio-based detection. Here we present an audiovisual approach to distinguishing laughter from speech and we show that integrating the information from audio and video leads to an improved reliability of audiovisual approach in comparison to single-modal approaches. We also investigated the level at which audiovisual information should be fused for the best performance. When tested on 96 audiovisual sequences depicting spontaneously displayed (as opposed to posed) laughter and speech episodes, the proposed audiovisual feature-level approach achieved a 86.9% recall rate with 76.7% precision.
Keywords :
audio-visual systems; speech recognition; video signal processing; audiovisual data processing; audiovisual discrimination; laughter detection; nonlinguistic information processing; single-modal approaches; Ambient intelligence; Audio recording; Detectors; Face detection; Hidden Markov models; Natural languages; Speech recognition; Support vector machine classification; Support vector machines; Video recording; Audiovisual data processing; data fusion; laughter detection; nonlinguistic information processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518810
Filename :
4518810
Link To Document :
بازگشت