• DocumentCode
    865929
  • Title

    Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression

  • Author

    Yoshii, Kazuyoshi ; Goto, Masataka ; Okuno, Hiroshi G.

  • Author_Institution
    Dept. of Intelligence Sci. & Technol., Kyoto Univ.
  • Volume
    15
  • Issue
    1
  • fYear
    2007
  • Firstpage
    333
  • Lastpage
    345
  • Abstract
    This paper describes a system that detects onsets of the bass drum, snare drum, and hi-hat cymbals in polyphonic audio signals of popular songs. Our system is based on a template-matching method that uses power spectrograms of drum sounds as templates. This method calculates the distance between a template and each spectrogram segment extracted from a song spectrogram, using Goto´s distance measure originally designed to detect the onsets in drums-only signals. However, there are two main problems. The first problem is that appropriate templates are unknown for each song. The second problem is that it is more difficult to detect drum-sound onsets in sound mixtures including various sounds other than drum sounds. To solve these problems, we propose template-adaptation and harmonic-structure-suppression methods. First of all, an initial template of each drum sound, called a seed template, is prepared. The former method adapts it to actual drum-sound spectrograms appearing in the song spectrogram. To make our system robust to the overlapping of harmonic sounds with drum sounds, the latter method suppresses harmonic components in the song spectrogram before the adaptation and matching. Experimental results with 70 popular songs showed that our template-adaptation and harmonic-structure-suppression methods improved the recognition accuracy and achieved 83%, 58%, and 46% in detecting onsets of the bass drum, snare drum, and hi-hat cymbals, respectively
  • Keywords
    acoustic signal processing; audio acoustics; audio signal processing; harmonics suppression; musical instruments; bass drum; drum sound recognition; drum-sound spectrograms; harmonic sounds; harmonic structure suppression; harmonic-structure-suppression methods; hi-hat cymbals; polyphonic audio signals; snare drum; song spectrogram; spectrogram templates; template-matching method; Acoustic signal detection; Adaptive signal detection; Content based retrieval; Information analysis; Instruments; Multiple signal classification; Music information retrieval; Rhythm; Signal design; Spectrogram; Drum sound recognition; harmonic structure suppression; polyphonic audio signal; spectrogram template; template adaptation; template matching;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.876754
  • Filename
    4032798