• Title of article

    Modulation-scale analysis for content identification

  • Author/Authors

    S.، Sukittanon, نويسنده , , L.E.، Atlas, نويسنده , , J.W.، Pitton, نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2004
  • Pages
    -3022
  • From page
    3023
  • To page
    0
  • Abstract
    For nonstationary signal classification, e.g., speech or music, features are traditionally extracted from a time-shifted, yet short data window. For many applications, these short-term features do not efficiently capture or represent longer term signal variation. Partially motivated by human audition, we overcome the deficiencies of short-term features by employing modulation-scale analysis for long-term feature analysis. Our analysis, which uses time-frequency theory integrated with psychoacoustic results on modulation frequency perception, not only contains short-term information about the signals, but also provides long-term information representing patterns of time variation. This paper describes these features and their normalization. We demonstrate the effectiveness of our long-term features over conventional short-term features in content-based audio identification. A simulated study using a large data set, including nearly 10 000 songs and requiring over a billion audio pairwise comparisons, shows that modulation-scale features improves content identification accuracy substantially, especially when time and frequency distortions are imposed.
  • Keywords
    Evidence-based interventions , School psychology , Training , Exposure to interventions , Training challenges
  • Journal title
    IEEE TRANSACTIONS ON SIGNAL PROCESSING
  • Serial Year
    2004
  • Journal title
    IEEE TRANSACTIONS ON SIGNAL PROCESSING
  • Record number

    104744