DocumentCode
2472047
Title
Clip retrieval using multi-modal biometrics in meeting archives
Author
Vajaria, Himanshu ; Sarkar, Sudeep ; Kasturi, Rangachar
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
We present a system to retrieve all clips from a meeting archive that show a particular individual speaking, using a single face or voice sample as the query. The system incorporates three novel ideas. One, rather than match the query to each individual sample in the archive, samples within a meeting are grouped first, generating a cluster of samples per individual. The query is then matched to the cluster, taking advantage of multiple samples to yield a robust decision. Two, automatic audio-visual association is performed which allows a bi-modal retrieval of clips, even when the query is uni-modal. Three, the biometric recognition uses individual-specific score distributions learnt from the clusters, in a likelihood ratio based decision framework that obviates the need for explicit normalization or modality weighting. The resulting system, which is completely automated, performs with 92.6% precision at 90% recall on a dataset of 16 real meetings spanning a total of 13 hours.
Keywords
audio-visual systems; biometrics (access control); face recognition; image matching; image sampling; information retrieval systems; learning (artificial intelligence); pattern clustering; speaker recognition; video retrieval; automatic audio-visual association; clip retrieval; individual-specific score distribution; meeting archive; multimodal biometric recognition; query matching; robust decision framework; speaker face recognition; Biometrics; Cameras; Computer science; Face detection; Mutual information; Polynomials; Robustness; Speech; Support vector machines; Video recording;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4760962
Filename
4760962
Link To Document