• DocumentCode
    730078
  • Title

    Multi-instrument detection in polyphonic music using Gaussian Mixture based factorial HMM

  • Author

    Ranjani, H.G. ; Sreenivas, T.V.

  • Author_Institution
    Dept. of ECE., Indian Inst. of Sci., Bangalore, India
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    191
  • Lastpage
    195
  • Abstract
    We formulate the problem of detecting the constituent instruments in a polyphonic music piece as a joint decoding problem. From monophonic data, parametric Gaussian Mixture Hidden Markov Models (GM-HMM) are obtained for each instrument. We propose a method to use the above models in a factorial framework, termed as Factorial GM-HMM (F-GM-HMM). The states are jointly inferred to explain the evolution of each instrument in the mixture observation sequence. The dependencies are decoupled using variational inference technique. We show that the joint time evolution of all instruments´ states can be captured using F-GM-HMM. We compare performance of proposed method with that of Student´s-t mixture model (tMM) and GM-HMM in an existing latent variable framework. Experiments on two to five polyphony with 8 instrument models trained on the RWC dataset, tested on RWC and TRIOS datasets show that F-GM-HMM gives an advantage over the other considered models in segments containing co-occurring instruments.
  • Keywords
    Gaussian processes; acoustic signal detection; hidden Markov models; mixture models; music; musical instruments; variational techniques; F-GM-HMM; Gaussian mixture based factorial HMM; Gaussian mixture hidden Markov model; RWC dataset; TRIOS datasets; constituent instrument detection; factorial GM-HMM; joint decoding problem; joint time evolution; mixture observation sequence; monophonic data; multi-instrument detection; parametric GM-HMM; polyphonic music; variational inference technique; Data models; Hidden Markov models; Instruments; Music; Speech; System-on-chip; Yttrium; FGM-HMM; Factorial HMM; Latent Variable; Polyphony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177958
  • Filename
    7177958