• DocumentCode
    1973049
  • Title

    To catch a chorus: using chroma-based representations for audio thumbnailing

  • Author

    Bartsch, Mark A. ; Wakefield, Gregory H.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    15
  • Lastpage
    18
  • Abstract
    An important application for use with multimedia databases is a browsing aid, which allows a user to quickly and efficiently preview selections from either a database or from the results of a database query. Methods for facilitating browsing, though, are necessarily media dependent. We present one such method that produces short, representative samples (or "audio thumbnails") of selections of popular music. This method attempts to identify the chorus or refrain of a song by identifying repeated sections of the audio waveform. A reduced spectral representation of the selection based on a chroma transformation of the spectrum is used to find repeating patterns. This representation encodes harmonic relationships in a signal and thus is ideal for popular music, which is often characterized by prominent harmonic progressions. The method is evaluated over a sizable database of popular music and found to perform well, with most of the errors resulting from songs that do not meet our structural assumptions
  • Keywords
    audio signal processing; information retrieval; multimedia databases; music; pattern recognition; spectral analysis; audio thumbnail; browsing aid; chroma transformation; chroma-based representations; chromagram; harmonic progressions; multimedia databases; repeating patterns; retrieval systems; selection previews; song chorus; song refrain; Audio databases; Costs; Image databases; Marine vehicles; Multimedia databases; Multimedia systems; Multiple signal classification; Performance evaluation; Sampling methods; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the
  • Conference_Location
    New Platz, NY
  • Print_ISBN
    0-7803-7126-7
  • Type

    conf

  • DOI
    10.1109/ASPAA.2001.969531
  • Filename
    969531