DocumentCode :
1400945
Title :
Simultaneous Beat and Downbeat-Tracking Using a Probabilistic Framework: Theory and Large-Scale Evaluation
Author :
Peeters, Geoffroy ; Papadopoulos, Helene
Author_Institution :
STMS, Sound Anal./Synthesis Team, IRCAM, Paris, France
Volume :
19
Issue :
6
fYear :
2011
Firstpage :
1754
Lastpage :
1769
Abstract :
This paper deals with the simultaneous estimation of beat and downbeat location in an audio-file. We propose a probabilistic framework in which the time of the beats and their associated beat-position-inside-a-bar roles; hence, the downbeats, are considered as hidden states and are estimated simultaneously using signal observations. For this, we propose a “reverse” Viterbi algorithm which decodes hidden states over beat-numbers. A beat-template is used to derive the beat observation probabilities. For this task, we propose the use of a machine-learning method, the Linear Discriminant Analysis, to estimate the most discriminative beat-templates. We propose two functions to derive the beat-position-inside-a-bar observation probability: the variation over time of chroma vectors and the spectral balance. We then perform a large-scale evaluation of beat and downbeat-tracking using six test-sets. In this, we study the influence of the various parameters of our method, compare this method to our previous beat and downbeat-tracking algorithms, and compare our results to state-of-the-art results on two test-sets for which results have been published. We finally discuss the results obtained by our system in the MIREX-09 and MIREX-10 contests for which our system ranked among the first for the “McKinney Collection” test-set.
Keywords :
audio signal processing; learning (artificial intelligence); probability; audio-file; beat observation probabilities; beat tracking; beat-position-inside-a-bar observation probability; beat-template; chroma vectors; downbeat-tracking; large-scale evaluation; linear discriminant analysis; machine-learning method; probabilistic framework; reverse Viterbi algorithm; simultaneous beat; simultaneous estimation; spectral balance; Algorithm design and analysis; Estimation; Hidden Markov models; Probabilistic logic; Speech; Speech processing; Viterbi algorithm; Beat-templates; beat-tracking; downbeat-tracking; hidden Markov model (HMM); linear discriminant analysis (LDA); reverse Viterbi decoding;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2098869
Filename :
5664773
Link To Document :
بازگشت