DocumentCode
1697449
Title
Subband autocorrelation features for video soundtrack classification
Author
Cotton, Courtenay V. ; Ellis, Daniel P. W.
Author_Institution
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
fYear
2013
Firstpage
8663
Lastpage
8666
Abstract
Inspired by the system presented in [1], we have developed novel auditory-model-based features that preserve the fine time structure lost in conventional frame-based features. While the original auditory model is computationally intense, we present a simpler system that runs about ten times faster but achieves equivalent performance. We use these features for video soundtrack classification with the Columbia Consumer Video dataset, showing that the new features alone are roughly comparable to traditional MFCCs, but combining classifiers based on both features achieves a substantial mean Average Precision improvement of 15% over the MFCC baseline.
Keywords
acoustic signal processing; audio signal processing; cepstral analysis; video signal processing; Columbia consumer video dataset; MFCC baseline; auditory model; auditory-model-based features; conventional frame-based features; equivalent performance; subband autocorrelation features; substantial mean average precision improvement; video soundtrack classification; Correlation; Histograms; Learning systems; Mel frequency cepstral coefficient; Support vector machines; Time-frequency analysis; Vectors; Acoustic signal processing; Auditory models; Multimedia databases; Video indexing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639357
Filename
6639357
Link To Document