DocumentCode :
1453159
Title :
An End-to-End Machine Learning System for Harmonic Analysis of Music
Author :
Ni, Yizhao ; McVicar, Matt ; Santos-Rodríguez, Raúl ; De Bie, Tijl
Author_Institution :
Dept. of Eng. Math., Univ. of Bristol, Bristol, UK
Volume :
20
Issue :
6
fYear :
2012
Firstpage :
1771
Lastpage :
1783
Abstract :
We present a new system for the harmonic analysis of popular musical audio. It is focused on chord estimation, although the proposed system additionally estimates the key sequence and bass notes. It is distinct from competing approaches in two main ways. First, it makes use of a new improved chromagram representation of audio that takes the human perception of loudness into account. Furthermore, it is the first system for joint estimation of chords, keys, and bass notes that is fully based on machine learning, requiring no expert knowledge to tune the parameters. This means that it will benefit from future increases in available annotated audio files, broadening its applicability to a wider range of genres. In all of three evaluation scenarios, including a new one that allows evaluation on audio for which no complete ground truth annotation is available, the proposed system is shown to be faster, more memory efficient, and more accurate than the state-of-the-art.
Keywords :
audio signal processing; harmonic analysis; learning (artificial intelligence); annotated audio files; audio representation; chord estimation; chromagram representation; end-to-end machine learning system; harmonic analysis; music; Harmonic analysis; Hidden Markov models; Humans; Maximum likelihood estimation; Topology; Vectors; Audio chord estimation; harmony progression analyzer (HPA); loudness-based chromagram; machine learning; meta-song evaluation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2188516
Filename :
6155600
Link To Document :
بازگشت