• DocumentCode
    3168554
  • Title

    Probabilistic model for main melody extraction using Constant-Q transform

  • Author

    Fuentes, Benoit ; Liutkus, Antoine ; Badeau, Roland ; Richard, Gaël

  • Author_Institution
    Inst. Telecom, Telecom ParisTech, Paris, France
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    5357
  • Lastpage
    5360
  • Abstract
    Dimension reduction techniques such as Nonnegative Tensor Factorization are now classical for both source separation and estimation of multiple fundamental frequencies in audio mixtures. Still, few studies jointly addressed these tasks so far, mainly because separation is often based on the Short Term Fourier Transform (STFT) whereas recent music analysis algorithms are rather based on the Constant-Q Transform (CQT). The CQT is practical for pitch estimation because a pitch shift amounts to a translation of the CQT representation, whereas it produces a scaling of the STFT. Conversely, no simple inversion of the CQT was available until recently, preventing it from being used for source separation. Benefiting from advances both in the inversion of the CQT and in statistical modeling, we show how recent techniques designed for music analysis can also be used for source separation with encouraging results, thus opening the path to many crossovers between separation and analysis.
  • Keywords
    Fourier transforms; audio signal processing; estimation theory; music; source separation; statistical analysis; CQT representation; STFT; audio mixtures; constant-Q transform; dimension reduction techniques; main melody extraction; multiple fundamental frequencies; music analysis algorithms; pitch estimation; pitch shift; probabilistic model; short term Fourier transform; source separation; statistical modeling; Analytical models; Estimation; Harmonic analysis; Instruments; Source separation; Time frequency analysis; Transforms; CQT; NTF; PLCA; audio source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289131
  • Filename
    6289131