• DocumentCode
    1392590
  • Title

    Extracting Predominant Local Pulse Information From Music Recordings

  • Author

    Grosche, Peter ; Müller, Meinard

  • Author_Institution
    Max-Planck Inst. fur Inf., Saarland Univ., Saarbrucken, Germany
  • Volume
    19
  • Issue
    6
  • fYear
    2011
  • Firstpage
    1688
  • Lastpage
    1701
  • Abstract
    The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.
  • Keywords
    audio recording; audio signal processing; music; beat information; music recordings; predominant local pulse information extraction; tempo information; Data mining; Estimation; Kernel; Multiple signal classification; Music; Speech; Speech processing; Audio feature; beat tracking; mid-level representation; music signal processing; musical pulse; onset detection; tempo estimation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2096216
  • Filename
    5654580