Title :
Improved musical onset detection with Convolutional Neural Networks
Author :
Schluter, Jan ; Bock, Sebastian
Author_Institution :
Austrian Res. Inst. for Artificial Intell., Vienna, Austria
Abstract :
Musical onset detection is one of the most elementary tasks in music analysis, but still only solved imperfectly for polyphonic music signals. Interpreted as a computer vision problem in spectrograms, Convolutional Neural Networks (CNNs) seem to be an ideal fit. On a dataset of about 100 minutes of music with 26k annotated onsets, we show that CNNs outperform the previous state-of-the-art while requiring less manual preprocessing. Investigating their inner workings, we find two key advantages over hand-designed methods: Using separate detectors for percussive and harmonic onsets, and combining results from many minor variations of the same scheme. The results suggest that even for well-understood signal processing tasks, machine learning can be superior to knowledge engineering.
Keywords :
audio signal processing; information retrieval; learning (artificial intelligence); music; neural nets; CNN; computer vision problem; convolutional neural networks; hand-designed methods; harmonic onsets; improved musical onset detection; knowledge engineering; machine learning; music analysis; percussive onsets; polyphonic music signals; signal processing tasks; spectrograms; Computer architecture; Convolution; Detectors; Music information retrieval; Neural networks; Spectrogram; Training; Multi-layer neural network; Music information retrieval;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854953