Title :
Unsupervised feature extraction for multimedia event detection and ranking using audio content
Author :
Amid, Ehsan ; Mesaros, Annamaria ; Palomaki, Kalle J. ; Laaksonen, Jorma ; Kurimo, Mikko
Author_Institution :
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
Abstract :
In this paper, we propose a new approach to classify and rank multimedia events based purely on audio content using video data from TRECVID-2013 multimedia event detection (MED) challenge. We perform several layers of nonlinear mappings to extract a set of unsupervised features from an initial set of temporal and spectral features to obtain a superior presentation of the atomic audio units. Additionally, we propose a novel weighted divergence measure for kernel based classifiers. The extensive set of experiments confirms that augmentation of the proposed steps results in an improved accuracy for most of the event classes.
Keywords :
audio signal processing; audio streaming; feature extraction; multimedia systems; signal classification; spectral analysis; unsupervised learning; TRECVID-2013; atomic audio units; audio content; kernel based classifier; multimedia event classification; multimedia event detection; multimedia event ranking; nonlinear mapping; spectral features; temporal features; unsupervised feature extraction; video data; weighted divergence measure; Event detection; Feature extraction; Histograms; Multimedia communication; Noise reduction; Speech; Vectors; Bag of Words; Multimedia Event Detection; Stacked Denoising Autoencoders; Term Weighting; Unsupervised Feature Extraction; Weighted Jensen-Shannon Divergence;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854743