DocumentCode :
730337
Title :
A hybrid recurrent neural network for music transcription
Author :
Sigtia, Siddharth ; Benetos, Emmanouil ; Boulanger-Lewandowski, Nicolas ; Weyde, Tillman ; d´Avila Garcez, Artur S. ; Dixon, Simon
Author_Institution :
Centre for Digital Music, Queen Mary Univ. of London, London, UK
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
2061
Lastpage :
2065
Abstract :
We investigate the problem of incorporating higher-level symbolic score-like information into Automatic Music Transcription (AMT) systems to improve their performance. We use recurrent neural networks (RNNs) and their variants as music language models (MLMs) and present a generative architecture for combining these models with predictions from a frame level acoustic classifier. We also compare different neural network architectures for acoustic modeling. The proposed model computes a distribution over possible output sequences given the acoustic input signal and we present an algorithm for performing a global search for good candidate transcriptions. The performance of the proposed model is evaluated on piano music from the MAPS dataset and we observe that the proposed model consistently outperforms existing transcription methods.
Keywords :
audio signal processing; music; recurrent neural nets; AMT systems; MLM; RNN; acoustic classifier; acoustic input signal; acoustic modeling; automatic music transcription; generative architecture; hybrid recurrent neural network; music language models; music transcription; neural network architectures; Acoustics; Computational modeling; Computer architecture; Hidden Markov models; Predictive models; Recurrent neural networks; Training; Music Language Models; Polyphonic Music Transcription; Recurrent Neural Networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178333
Filename :
7178333
Link To Document :
بازگشت