• DocumentCode
    1697449
  • Title

    Subband autocorrelation features for video soundtrack classification

  • Author

    Cotton, Courtenay V. ; Ellis, Daniel P. W.

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • fYear
    2013
  • Firstpage
    8663
  • Lastpage
    8666
  • Abstract
    Inspired by the system presented in [1], we have developed novel auditory-model-based features that preserve the fine time structure lost in conventional frame-based features. While the original auditory model is computationally intense, we present a simpler system that runs about ten times faster but achieves equivalent performance. We use these features for video soundtrack classification with the Columbia Consumer Video dataset, showing that the new features alone are roughly comparable to traditional MFCCs, but combining classifiers based on both features achieves a substantial mean Average Precision improvement of 15% over the MFCC baseline.
  • Keywords
    acoustic signal processing; audio signal processing; cepstral analysis; video signal processing; Columbia consumer video dataset; MFCC baseline; auditory model; auditory-model-based features; conventional frame-based features; equivalent performance; subband autocorrelation features; substantial mean average precision improvement; video soundtrack classification; Correlation; Histograms; Learning systems; Mel frequency cepstral coefficient; Support vector machines; Time-frequency analysis; Vectors; Acoustic signal processing; Auditory models; Multimedia databases; Video indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639357
  • Filename
    6639357