DocumentCode :
2090373
Title :
LISTEN: a system for locating and tracking individual speakers
Author :
Collobert, M. ; Feraud, R. ; Le Tourneur, G. ; Bernier, O. ; Viallet, J.E. ; Mahieux, Y. ; Collobert, D.
Author_Institution :
CNET, Lannion, France
fYear :
1996
fDate :
14-16 Oct 1996
Firstpage :
283
Lastpage :
288
Abstract :
Both visual and acoustical informations provide effective means of telecommunication between persons. In this context, the face is the most important part of the person both visually and acoustically. We describe how the cooperation of image and audio processing allows to track a person´s face and to collect the audio information it produces. We present detection techniques of regions of interest (e.g. Moving regions of skin color), coupled with a neural network based face detector with a low false alarm rate, to locate and track faces. The system is connected to a nine microphone array adaptive beam forming which performs immediate beam forming. Visual and acoustical informations from the speaker face are thus obtained in real time
Keywords :
acoustic applications; face recognition; neural nets; optical tracking; acoustical information; audio information; audio processing; detection techniques; immediate beam forming; individual speaker location; individual speaker tracking; microphone array adaptive beam forming; neural network based face detector; skin color; Acoustic signal detection; Array signal processing; Biological systems; Detectors; Evolution (biology); Face detection; Loudspeakers; Microphone arrays; Neural networks; Skin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Face and Gesture Recognition, 1996., Proceedings of the Second International Conference on
Conference_Location :
Killington, VT
Print_ISBN :
0-8186-7713-9
Type :
conf
DOI :
10.1109/AFGR.1996.557278
Filename :
557278
Link To Document :
بازگشت