مرکز منطقه ای اطلاع رساني علوم و فناوري - Benchmarking methods for audio-visual recognition using tiny training sets

DocumentCode :

1668688

Title :

Benchmarking methods for audio-visual recognition using tiny training sets

Author :

Alameda-Pineda, Xavier ; Sanchez-Riera, Jordi ; Horaud, Radu

Author_Institution :

INRIA Grenoble Rhone-Alpes, Grenoble, France

fYear :

2013

Firstpage :

3662

Lastpage :

3666

Abstract :

The problem of choosing a classifier for audio-visual command recognition is addressed. Because such commands are culture- and user-dependant, methods need to learn new commands from a few examples. We benchmark three state-of-the-art discriminative classifiers based on bag of words and SVM. The comparison is made on monocular and monaural recordings of a publicly available dataset. We seek for the best trade off between speed, robustness and size of the training set. In the light of over 150,000 experiments, we conclude that this is a promising direction of work towards a flexible methodology that must be easily adaptable to a large variety of users.

Keywords :

audio signal processing; image classification; speech recognition; support vector machines; SVM; audio-visual command recognition classifier; bag of words; benchmarking methods; culture-dependant; discriminative classifiers; monaural recordings; monocular recordings; tiny training sets; user-dependant; Accuracy; Benchmark testing; Kernel; Robots; Robustness; Training; Visualization; Audio-visual classification; command recognition; tiny training sets;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6638341

Filename :

6638341

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1668688