DocumentCode :
730129
Title :
A dynamic programming variant of non-negative matrix deconvolution for the transcription of struck string instruments
Author :
Ewert, Sebastian ; Plumbley, Mark D. ; Sandler, Mark
Author_Institution :
Queen Mary Univ. of London, London, UK
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
569
Lastpage :
573
Abstract :
Given a musical audio recording, the goal of music transcription is to determine a score-like representation of the piece underlying the recording. Most current transcription methods employ variants of non-negative matrix factorization (NMF), which often fails to robustly model instruments producing non-stationary sounds. Using entire time-frequency patterns to represent sounds, non-negative matrix deconvolution (NMD) can capture certain types of non-stationary behavior but is only applicable if all sounds have the same length. In this paper, we present a novel method that combines the non-stationarity modeling capabilities available with NMD with the variable note lengths possible with NMF. Identifying frames in NMD patterns with states in a dynamical system, our method iteratively generates sound-object candidates separately for each pitch, which are then combined in a global optimization. We demonstrate the transcription capabilities of our method using piano pieces assuming the availability of single note recordings as training data.
Keywords :
audio signal processing; deconvolution; dynamic programming; matrix decomposition; music; dynamic programming; musical audio recording; nonnegative matrix deconvolution; nonnegative matrix factorization; nonstationarity modeling; score like representation; single note recordings; struck string instrument transcription; time frequency patterns; variable note lengths; Acoustics; Deconvolution; Hidden Markov models; Instruments; Signal processing; Speech; Time-frequency analysis; Convolutive Signal Models; Dynamical Systems; Music Transcription; Non-Negative Matrix Deconvolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178033
Filename :
7178033
Link To Document :
بازگشت