DocumentCode :
3304164
Title :
Multi-view video based tracking and audio-visual identification of persons in a human-computer-interaction scenario
Author :
Meudt, Sascha ; Glodek, Michael ; Schels, Martin ; Schwenker, Friedhelm
Author_Institution :
Inst. of Neural Inf. Process., Univ. of Ulm, Ulm, Germany
fYear :
2013
fDate :
13-15 June 2013
Firstpage :
116
Lastpage :
121
Abstract :
User identification and tracking are definitely the basic tasks in any human computer interaction (HCI) scenario. For these tasks we propose a multi-view approach utilizing multi-camera systems and audio processing systems. Face detectors and face recognizers are based on orientation histogram and eigenface techniques, and Mel Frequency Cepstral Coefficients (MFCC) are applied for speaker identification. In order to achieve a robust user identification and localization spatio-temporal classifier fusion methods have been integrated into the overall classifier system, support vector machines (SVM) and k nearest neighbor (kNN) models are used as base classifiers. A general office environment with up to six persons was the test bed for data collection and numerical evaluation.
Keywords :
audio user interfaces; audio-visual systems; cameras; cepstral analysis; face recognition; human computer interaction; numerical analysis; object tracking; pattern classification; spatiotemporal phenomena; speaker recognition; support vector machines; video signal processing; HCI scenario; MFCC; SVM; audio processing systems; audio-visual person identification; data collection; eigenface techniques; face detectors; face recognizers; human-computer-interaction scenario; k nearest neighbor models; kNN models; mel frequency cepstral coefficients; multicamera systems; multiview video based person tracking; numerical evaluation; orientation histogram; robust user identification spatio-temporal classifier fusion methods; robust user localization spatio-temporal classifier fusion methods; speaker identification; support vector machines; user tracking; Cameras; Face; Histograms; Image color analysis; Mel frequency cepstral coefficient; Support vector machines; Vectors; Human Computer Interaction; Human Position Estimation; Person Identification; Speaker Identification; Video Tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetics (CYBCONF), 2013 IEEE International Conference on
Conference_Location :
Lausanne
Type :
conf
DOI :
10.1109/CYBConf.2013.6617454
Filename :
6617454
Link To Document :
بازگشت