Title :
Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions
Author :
Lachambre, Hélène ; André-Obrecht, Régine ; Pinquier, Julien
Author_Institution :
IRIT, Univ. de Toulouse, Toulouse, France
Abstract :
In the context of music indexation, it would be useful to have a precise information about the number of sources performing; a source is a solo voice or an isolated instrument which produces a single note at any time. This correspondence discusses the automatic distinction between monophonic music excerpts, where only one source is present, and polyphonic ones. Our method is based on the analysis of a “confidence indicator,” which gives the confidence (in fact its inverse) on the current estimated fundamental frequency (pitch). In a monophony, the confidence indicator is low. In a polyphony, the confidence indicator is higher and varies more. This leads us to compute the short term mean and variance of this indicator, take this 2-D vector as the observation vector and model its conditional distribution with Weibull bivariate models. This probability density function is characterized by five parameters. A method to perform their estimation is developed (in theory and practice). The decision is taken considering the maximum likelihood, computed over one second. The best configuration gives a global error rate of 6.3%, performed on a balanced corpus (18 minutes in total).
Keywords :
Weibull distribution; audio signal processing; maximum likelihood estimation; music; 2D vector; Weibull bivariate distribution; Weibull bivariate model; confidence indicator; fundamental frequency; maximum likelihood; monophonic music excerpt; monophony; music indexation; observation vector; polyphony; probability density function; solo voice; Computational modeling; Context; Estimation; Histograms; Instruments; Shape; Training; Monophony; Weibull bivariate distribution; polyphony;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2089517