DocumentCode :
698013
Title :
Singing voice detection in monophonic and polyphonic contexts
Author :
Lachambre, Helene ; Andre-Obrecht, Regine ; Pinquier, Julien
Author_Institution :
IRIT - Univ. de Toulouse, Narbonne, France
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1344
Lastpage :
1348
Abstract :
In this article, we present an improvement of a previous singing voice detector. This new detector is in two steps. First, we distinguish monophonies from polyphonies. This distinction is based on the fact that the pitch estimated in a monophony is more reliable than the one estimated in a polyphony. We study the short term mean and variance of a confidence indicator; their repartition is modelled with bivariate Weibull distributions. We present a new method to estimate the parameters of these distributions with the moment method. Then, we detect the presence of singing voice. This is done by looking for the presence of vibrato, an oscillation of the fundamental frequency between 4 and 8 Hz. In a monophonic context, we look for vibrato on the pitch. In a polyphonic context, we first make a frequency tracking on the whole spectrogram, and then look for vibrato on each frequency tracks. Results are promising: from a global error rate of 29.7 % (previous method), we fall to a global error rate of 25 %. This means that taking into account the context (monophonic or polyphonic) leads to a relative gain of more than 16 %.
Keywords :
music; speech recognition; bivariate Weibull distribution; confidence indicator; moment method; monophonic context; polyphonic context; singing voice detection; spectrogram; vibrato; Computational modeling; Context; Detectors; Error analysis; Harmonic analysis; Instruments; Reliability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077587
Link To Document :
بازگشت