Title :
Audio-Based Video Genre Identification
Author :
Rouvier, Mickael ; Oger, Stanislas ; Linares, Georges ; Matrouf, Driss ; Merialdo, Bernard ; Li, Yingbo
Author_Institution :
Lab. Fondamental d´Inf. (LIF), Marseille, France
Abstract :
This paper presents investigations about the automatic identification of video genre by audio channel analysis. Genre refers to editorial styles such commercials, movies, sports... We propose and evaluate some methods based on both low and high level descriptors, in cepstral or time domains, but also by analyzing the global structure of the document and the linguistic contents. Then, the proposed features are combined and their complementarity is evaluated. On a database composed of single-stories web-videos, the best audio-only based system performs 9% of Classification Error Rate (CER). Finally, we evaluate the complementarity of the proposed audio features and video features that are classically used for Video Genre Identification (VGI). Results demonstrate the complementarity of the modalities for genre recognition, the final audio-video system reaching 6% CER.
Keywords :
audio signal processing; cepstral analysis; time-domain analysis; video signal processing; CER; VGI; audio channel analysis; audio features; audio-only based system; cepstral domains; classification error rate; commercials; document contents; global structure; high level descriptors; linguistic contents; low level descriptors; movies; single-stories Web-videos; sports; time domains; video features; video genre identification; Cepstral analysis; Feature extraction; Motion pictures; Pragmatics; Speech; Support vector machines; Automatic classification; linguistic feature extraction; video genre classification;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2014.2387411