• DocumentCode
    695729
  • Title

    Structure-aware dictionary learning with harmonic atoms

  • Author

    O´Hanlon, Ken ; Plumbley, Mark D.

  • Author_Institution
    Centre for Digital Music, Queen Mary, Univ. of London, London, UK
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    1761
  • Lastpage
    1765
  • Abstract
    Non-negative blind signal decomposition methods are widely used for musical signal processing tasks, such as automatic transcription and source separation. A spectrogram can be decomposed into a dictionary of full spectrum basis atoms and their corresponding time activation vectors using methods such as Non-negative Matrix Factorisation (NMF) and Non-negative K-SVD (NN-K-SVD). These methods are constrained by their learning order and problems posed by overlapping sources in the time and frequency domains of the source spectrogram. We consider that it may be possible to improve on current results by providing prior knowledge on the number of sources in a given spectrogram and on the individual structure of the basis atoms, an approach we refer to as structure-aware dictionary learning. In this work we consider dictionary recoverability of harmonic atoms, as harmonicity is a common structure in music signals. We present results showing improvements in recoverability using structure-aware decomposition methods, based on NN-K-SVD and NMF. Finally we propose an alternative structure-aware dictionary learning algorithm incorporating the advantages of NMF and NN-K-SVD.
  • Keywords
    audio signal processing; blind source separation; frequency-domain analysis; singular value decomposition; time-domain analysis; NMF; NN-K-SVD; frequency domains; harmonic atom dictionary recoverability; musical signal processing; nonnegative K-SVD; nonnegative blind signal decomposition method; nonnegative matrix factorisation; source spectrogram; structure-aware dictionary learning; time activation vectors; time domains; Atomic clocks; Dictionaries; Encoding; Harmonic analysis; Matching pursuit algorithms; Spectrogram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074279