Title :
Partial clustering using a time-varying frequency model for singing voice detection
Author :
Regnier, L. ; Peeters, G.
Author_Institution :
IRCAM, CNRS, Paris, France
Abstract :
We propose a new method to group partials produced by each instrument of a polyphonic audio mixture. This method works for pitched and harmonic instruments and is specially adapted to singing voice. In our approach, we model time-varying frequencies of partials as a slowly varying frequency plus a sinusoidal modulation. The parameters obtained with this model plus some common Auditory Scene Analysis principles are used to define a similarity measure between partials. This multi-criterion based measure is then used to build the input similarity matrix of a clustering algorithm. Clusters obtained are groups of harmonically related partials. We evaluate the ability of our method to group partials per source when one of the sources is a singing voice. We show that partial clustering is a promising approach for singing voice detection and separation.
Keywords :
acoustic signal detection; music; pattern clustering; source separation; speech; speech processing; speech recognition; auditory scene analysis principle; harmonic instruments; input similarity matrix; legato; partial clustering; pitched instruments; polyphonic audio mixture; portamento; singing voice detection; sinusoidal modulation; slowly varying frequency; time-varying frequency model; vibrato; Auditory system; Clustering algorithms; Data mining; Frequency; Image analysis; Independent component analysis; Instruments; Music information retrieval; Source separation; Speech analysis; Polyphonic music analysis; Singing voice detection; Source separation; Vibrato detection;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495744