مرکز منطقه ای اطلاع رساني علوم و فناوري - Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction

DocumentCode :

1870437

Title :

Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction

Author :

Chaudhari, Upendra V. ; Ramaswamy, Ganesh N. ; Potamianos, Gerasimos ; Neti, Chalapathy

Author_Institution :

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Volume :

fYear :

2003

fDate :

6-9 July 2003

Abstract :

We examine the techniques for multi-modal biometric information fusion for verification and identification of speakers, where the reliability of each data stream, either audio of video, is modeled with parameters that are time-varying and depend on the context created by its local behavior. The complementary nature and the time dependent relative reliability of audio and video data is studied in the context of verification and identification, on data collected during a user´s interaction with an automated system. Of significance is that this data is not corrupted artificially. Particular focus is directed to verification and its ability to refine identification decisions, by indicating a level of confidence in the system decisions. Results show more striking effects for verification, when using time-dependent fusion, than for identification.

Keywords :

biometrics (access control); reliability; sensor fusion; speaker recognition; audio-visual speaker recognition; data stream; multimodal biometric information fusion; reliability prediction; speaker identification; speaker verification; time-dependent fusion; time-varying stream; Biometrics; Computerized monitoring; Context modeling; Databases; Robustness; Speaker recognition; Streaming media; Testing; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on

Print_ISBN :

0-7803-7965-9

Type :

conf

DOI :

10.1109/ICME.2003.1221235

Filename :

1221235

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1870437