Title :
A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization
Author :
Weninger, Felix ; Kirst, Christian ; Schuller, Bjorn ; Bungartz, Hans-Joachim
Author_Institution :
Machine Intell. & Signal Process. Group, Tech. Univ. Munchen, München, Germany
Abstract :
We introduce a novel method for the transcription of polyphonic piano music by discriminative training of support vector machines (SVMs). As features, we use pitch activations computed by supervised non-negative matrix factorization from low-level spectral features. Different approaches to low-level feature extraction, NMF dictionary learning and activation feature extraction are analyzed in a large-scale evaluation on eight hours of piano music including synthesized and real recordings. We conclude that the proposed method delivers state-of-the-art results and clearly outperforms SVMs using simple spectral features.
Keywords :
acoustic signal processing; feature extraction; information retrieval; learning (artificial intelligence); matrix decomposition; music; support vector machines; NMF dictionary learning; SVMs; activation feature extraction; discriminative training approach; large-scale evaluation; low-level spectral feature extraction; piano music; pitch activations; polyphonic piano note transcription; supervised nonnegative matrix factorization; support vector machines; Accuracy; Databases; Dictionaries; Feature extraction; Instruments; Support vector machines; Training; Transcription; music information retrieval; non-negative matrix factorization; sparse coding;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6637598