Title :
Transcribing Bach Chorales using non-negative matrix factorisation
Author :
Phon-Amnuaisuk, Somnuk
Author_Institution :
Music Informatic Res. Group, Univ. Tunku Abdul Rahman (UTAR), Petaling Jaya, Malaysia
Abstract :
This paper discusses our research on polyphonic music transcription using non-negative matrix factorization (NMF). The application of NMF in polyphonic transcription has two known limitations (i) the transcription output is a permutation of the input source signals (e.g., the following polyphonic input notes c, e, g and b may produce polyphonic output notes in the following order c, b, g and e) and (ii) the accuracy of the transcription depends on the accuracy of the factor r where r is the actual number of active pitches. This work proposes a novel approach by exploiting a tone model to tackle both the permutation of transcription output and the unknown factoring r issues. In our current implementation, the tone model is learned from the training data consisting of the pitches of the desired instrument. This approach offers an effective exploitation of the domain knowledge (i.e., tone model of each pitch). The empirical results show that the proposed tone-model initialised NMF (ICTM-NMF) could significantly improve the transcription output accuracy.
Keywords :
matrix decomposition; music; non-negative matrix factorisation; polyphonic music transcription; transcribing Bach Chorales; Encoding; Frequency domain analysis; Instruments; Mathematical model; Matrix decomposition; Noise; Time domain analysis;
Conference_Titel :
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-5856-1
DOI :
10.1109/ICALIP.2010.5685059