• DocumentCode
    3143211
  • Title

    Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis

  • Author

    Ochiai, Kazuki ; Kameoka, Hirokazu ; Sagayama, Shigeki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    133
  • Lastpage
    136
  • Abstract
    This paper proposes model-based non-negative matrix factorization (NMF) for estimating basis spectra and activations, detecting note onsets and offsets, and determining beat locations, simultaneously. Multipitch analysis is a process of detecting the pitch and onset of each note from a musical signal. Conventional NMF-based approaches often lead to unsatisfactory results very possibly due to the lack of musically meaningful constraints. As music is highly structured in terms of the temporal regularity underlying the onset occurrences of notes, we use this rhythmic structure to constrain NMF by parametrically modeling each note activation with a Gaussian mixture and derive an algorithm for iteratively updating model parameters. It is experimentally shown that the proposed model outperforms the standard NMF algorithms as regards onset detection rate.
  • Keywords
    electronic music; matrix decomposition; music; Gaussian mixture; NMF-based approaches; basis spectra estimation; explicit beat structure modeling; model-based nonnegative matrix factorization; musical signal; nonnegative matrix factorization-based multipitch analysis; note activation; onset detection rate; rhythmic structure; Databases; Hidden Markov models; Instruments; Music; Sparse matrices; Spectrogram; Standards; Non-negative matrix factorization; Onset detection; Polyphonic pitch transcription; Rhythmic/Beat structure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6287835
  • Filename
    6287835