Title :
Polyphonic transcription based on temporal evolution of spectral similarity of gaussian mixture models
Author :
Canadas-Quesada, F.J. ; Vera-Candeas, P. ; Ruiz-Reyes, N. ; Carabias-Orti, J.J.
Author_Institution :
Telecommun. Eng., Univ. of Jaen, Linares, Spain
Abstract :
This paper describes a system to transcribe multitimbral polyphonic music based on a joint multiple-F0 estimation. In a frame level, all possible fundamental frequency (F0) candidates are selected. Using a competitive strategy, a spectral envelope is estimated for each combination composed of F0 candidates under assumption that a polyphonic sound can be modeled by a sum of weighted gaussian mixture models (GMM). Since in polyphonic music the current spectral content depends to a large extent of the immediately previous one, the winner combination is determined taking into account the highest spectral similarity regarding to the past music events which has been selected from a set of combinations that minimize the current spectral distance between input-GMM spectrums. Our system was tested using several pieces of real-world music recordings from RWC Music Database. Evaluation shows encouraging results compared to a recent state-of-the-art method.
Keywords :
Gaussian processes; audio signal processing; mixture models; music; spectral analysis; RWC Music Database; joint multiple-F0 estimation; multitimbral polyphonic music; polyphonic sound; polyphonic transcription; real-world music recordings; spectral content; spectral distance; spectral envelope; spectral similarity; temporal evolution; weighted Gaussian mixture models; Estimation; Harmonic analysis; Hidden Markov models; Instruments; Multiple signal classification; Music; Speech;
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7