مرکز منطقه ای اطلاع رساني علوم و فناوري - Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions

DocumentCode :

1361336

Title :

Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions

Author :

Lachambre, Hélène ; André-Obrecht, Régine ; Pinquier, Julien

Author_Institution :

IRIT, Univ. de Toulouse, Toulouse, France

Volume :

Issue :

fYear :

2011

Firstpage :

1837

Lastpage :

1842

Abstract :

In the context of music indexation, it would be useful to have a precise information about the number of sources performing; a source is a solo voice or an isolated instrument which produces a single note at any time. This correspondence discusses the automatic distinction between monophonic music excerpts, where only one source is present, and polyphonic ones. Our method is based on the analysis of a “confidence indicator,” which gives the confidence (in fact its inverse) on the current estimated fundamental frequency (pitch). In a monophony, the confidence indicator is low. In a polyphony, the confidence indicator is higher and varies more. This leads us to compute the short term mean and variance of this indicator, take this 2-D vector as the observation vector and model its conditional distribution with Weibull bivariate models. This probability density function is characterized by five parameters. A method to perform their estimation is developed (in theory and practice). The decision is taken considering the maximum likelihood, computed over one second. The best configuration gives a global error rate of 6.3%, performed on a balanced corpus (18 minutes in total).

Keywords :

Weibull distribution; audio signal processing; maximum likelihood estimation; music; 2D vector; Weibull bivariate distribution; Weibull bivariate model; confidence indicator; fundamental frequency; maximum likelihood; monophonic music excerpt; monophony; music indexation; observation vector; polyphony; probability density function; solo voice; Computational modeling; Context; Estimation; Histograms; Instruments; Shape; Training; Monophony; Weibull bivariate distribution; polyphony;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2089517

Filename :

5610710

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1361336