DocumentCode :
1437211
Title :
Joint Estimation of Chords and Downbeats From an Audio Signal
Author :
Papadopoulos, Hélène ; Peeters, Geoffroy
Author_Institution :
STMS, Sound Anal./Synthesis Team, IRCAM/CNRS-STMS, Paris, France
Volume :
19
Issue :
1
fYear :
2011
Firstpage :
138
Lastpage :
152
Abstract :
We present a new technique for joint estimation of the chord progression and the downbeats from an audio file. Musical signals are highly structured in terms of harmony and rhythm. In this paper, we intend to show that integrating knowledge of mutual dependencies between chords and metric structure allows us to enhance the estimation of these musical attributes. For this, we propose a specific topology of hidden Markov models that enables modelling chord dependence on metric structure. This model allows us to consider pieces with complex metric structures such as beat addition, beat deletion or changes in the meter. The model is evaluated on a large set of popular music songs from the Beatles that present various metric structures. We compare a semi-automatic model in which the beat positions are annotated, with a fully automatic model in which a beat tracker is used as a front-end of the system. The results show that the downbeat positions of a music piece can be estimated in terms of its harmonic structure and that conversely the chord progression estimation benefits from considering the interaction between the metric and the harmonic structures.
Keywords :
audio signal processing; hidden Markov models; music; audio file; audio signals; beat addition; beat deletion; beat tracker; chord progression; chord progression estimation; downbeats; hidden Markov model; metric structure; music harmony; music rhythm; music songs; musical signal estimation; Content based retrieval; Data mining; Electrical capacitance tomography; Hidden Markov models; Information analysis; Music information retrieval; Postal services; Rhythm; Signal synthesis; Topology; Chords; downbeat; hidden Markov model (HMM);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2045236
Filename :
5428860
Link To Document :
بازگشت