مرکز منطقه ای اطلاع رساني علوم و فناوري - Audiovisual data fusion for successive speakers tracking

DocumentCode :

3669588

Title :

Audiovisual data fusion for successive speakers tracking

Author :

Quentin Labourey;Olivier Aycard;Denis Pellerin;Michele Rombaut

Author_Institution :

LIG, Grenoble, France

Volume :

fYear :

2014

Firstpage :

696

Lastpage :

701

Abstract :

In this paper, a human speaker tracking method on audio and video data is presented. It is applied to conversation tracking with a robot. Audiovisual data fusion is performed in a two-steps process. Detection is performed independently on each modality: face detection based on skin color on video data and sound source localization based on the time delay of arrival on audio data. The results of those detection processes are then fused thanks to an adaptation of bayesian filter to detect the speaker. The robot is able to detect the face of the talking person and to detect a new speaker in a conversation.

Keywords :

"Robots","Face","Sensors","Visualization","Skin","Bayes methods","Data integration"

Publisher :

ieee

Conference_Titel :

Computer Vision Theory and Applications (VISAPP), 2014 International Conference on

Type :

conf

Filename :

7294876

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3669588