Title :
Speaker detection and tracking at mobile robot platform
Author :
Yoon, Sang Min ; Kee, Seok Cheol
Author_Institution :
Human Comput. Interaction Lab., Samsung Adv. Inst. of Technol., Gyounggi-Do, South Korea
Abstract :
In this paper, we present a novel method to detect and track multiple people for an audio-visual speaker localization system using an uncalibrated camera and an 8 microphone array. We propose an object oriented speaker detection and tracking algorithm that uses a combination of skin color and human upper body part appearance information. Candidate regions of the input image are extracted from the color transform that is modeled by a 2D Gaussian function in a normalized RGB color space. We classify the humans and other objects which have a similar skin color region from the geometric structure and Hausdorff distance from the human shape. We decide the location of the speaker from audio and visual information and the robot tracks the speaker using CAMShift. The experimental results show that the proposed algorithm initializes and tracks the speaker automatically in various environments such as an image with skin color noise and complex background.
Keywords :
feature extraction; image classification; image colour analysis; mobile robots; object recognition; robot vision; speech recognition; target tracking; 2D Gaussian function; Hausdorff distance; color transform extracted regions; complex background image; computer vision; human classification; human robot interface; human upper body part appearance information; image matching; microphone array; mobile robot platform; multiple speaker tracking; normalized RGB color space; object classification; object oriented speaker detection; skin color; skin color noise image; sound source localization; uncalibrated camera; Biological system modeling; Cameras; Data mining; Humans; Microphone arrays; Mobile robots; Object detection; Object oriented modeling; Robot vision systems; Skin;
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on
Print_ISBN :
0-7803-8639-6
DOI :
10.1109/ISPACS.2004.1439126