مرکز منطقه ای اطلاع رساني علوم و فناوري - Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams

DocumentCode :

2506636

Title :

Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams

Author :

Roy, Anindya ; Marcel, Sébastien

Author_Institution :

Idiap Res. Inst., Martigny, Switzerland

fYear :

2010

fDate :

23-26 Aug. 2010

Firstpage :

4504

Lastpage :

4507

Abstract :

Person identification using audio (speech) and visual (facial appearance, static or dynamic) modalities, either independently or jointly, is a thoroughly investigated problem in pattern recognition. In this work, we explore a novel task : person identification in a cross-modal scenario, i.e., matching the speaker in an audio recording to the same speaker in a video recording, where the two recordings have been made during different sessions, using speaker specific information which is common to both the audio and video modalities. Several recent psychological studies have shown how humans can indeed perform this task with an accuracy significantly higher than chance. Here we propose two systems which can solve this task comparably well, using purely pattern recognition techniques. We hypothesize that such systems could be put to practical use in multimodal biometric and surveillance systems.

Keywords :

audio recording; biometrics (access control); pattern recognition; speaker recognition; video recording; video streaming; video surveillance; audio recording; audio streams; crossmodal matching; lip features; multimodal biometric systems; pattern recognition; person identification; speakers; surveillance systems; video recording; video streams; voice features; Feature extraction; Humans; Observers; Speech; Synchronization; Video recording; Visualization; Multi-modal biometrics; audio and video classification; audio-visual speaker recognition; crossmodal matching;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2010 20th International Conference on

Conference_Location :

Istanbul

ISSN :

1051-4651

Print_ISBN :

978-1-4244-7542-1

Type :

conf

DOI :

10.1109/ICPR.2010.1094

Filename :

5597384

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2506636