• DocumentCode
    950769
  • Title

    Singing voice identification using spectral envelope estimation

  • Author

    Bartsch, Mark A. ; Wakefield, Gregory H.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Univ. of Michigan, Ann Arbor, MI, USA
  • Volume
    12
  • Issue
    2
  • fYear
    2004
  • fDate
    3/1/2004 12:00:00 AM
  • Firstpage
    100
  • Lastpage
    109
  • Abstract
    In this paper, we present a spectrum-based system for singer identification that operates for the ideal case in which audio samples contain only the singer´s voice. Our method begins with the computation of a robust estimate of the spectral envelope called the composite transfer function (CTF). The CTF is derived from the instantaneous amplitude and frequency of the sinusoidal partials which make up the vocal signal. Unlike traditional source-filter theory , the CTF does not explicitly separate the spectral characteristics of the vocal source and the vocal tract filter. The principal components of the CTFs are used as features for a quadratic classifier to identify singers. The approach is validated on a database containing samples from twelve classically trained singers. In cross validation experiments, test set accuracies of approximately 95% are found for a baseline case. The classifier´s performance is not degraded when different vowels are included in classifier training and evaluation. Restricting the frequency range of the CTFs and using a test set containing samples extracted from solo performances of Italian arias reduces the test set accuracy to 70-80%.
  • Keywords
    audio signal processing; signal sampling; speaker recognition; spectral analysis; transfer functions; Italian arias; classifier evaluation; classifier training; composite transfer function; quadratic classifier; singer identification; singing voice identification; source filter theory; spectral characteristics; spectral envelope estimation; spectrum based system; vocal signal; vocal source; vocal tract filter; Amplitude estimation; Filtering theory; Filters; Frequency estimation; Instruments; Music information retrieval; Robustness; Testing; Timbre; Transfer functions;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2003.822637
  • Filename
    1284338