• DocumentCode
    41738
  • Title

    Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

  • Author

    Fuentes, B. ; Badeau, Roland ; Richard, Guilhem

  • Author_Institution
    Dept. Traitement du Signal et des Images, Telecom ParisTech, Paris, France
  • Volume
    21
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    1854
  • Lastpage
    1866
  • Abstract
    Recently, new methods for smart decomposition of time-frequency representations of audio have been proposed in order to address the problem of automatic music transcription. However those techniques are not necessarily suitable for notes having variations of both pitch and spectral envelope over time. The HALCA (Harmonic Adaptive Latent Component Analysis) model presented in this article allows considering those two kinds of variations simultaneously. Each note in a constant-Q transform is locally modeled as a weighted sum of fixed narrowband harmonic spectra, spectrally convolved with some impulse that defines the pitch. All parameters are estimated by means of the expectation-maximization (EM) algorithm, in the framework of Probabilistic Latent Component Analysis. Interesting priors over the parameters are also introduced in order to help the EM algorithm converging towards a meaningful solution. We applied this model for automatic music transcription: the onset time, duration and pitch of each note in an audio file are inferred from the estimated parameters. The system has been evaluated on two different databases and obtains very promising results.
  • Keywords
    audio signal processing; expectation-maximisation algorithm; principal component analysis; probability; EM algorithm; HALCA; audio file; audio harmonic adaptive latent component analysis; audio time-frequency representations; automatic music transcription; constant-Q transform; expectation-maximization algorithm; fixed narrowband harmonic spectra; music transcription; pitch envelope; probabilistic latent component analysis; smart decomposition; spectral envelope; Automatic transcription; multipitch estimation; nonnegative matrix factorization; probabilistic latent component analysis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2260741
  • Filename
    6510494