DocumentCode :
2357882
Title :
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model
Author :
Nakano, Masahiro ; Le Roux, Jonathan ; Kameoka, Hirokazu ; Nakamura, Tomohiko ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear :
2011
fDate :
16-19 Oct. 2011
Firstpage :
325
Lastpage :
328
Abstract :
This paper presents a Bayesian nonparametric latent source discovery method for music signal analysis. In audio signal analysis, an important goal is to decompose music signals into individual notes, with applications such as music transcription, source separation or note-level manipulation. Recently, the use of latent variable decompositions, especially nonnegative matrix factorization (NMF), has been a very active area of research. These methods are facing two, mutually dependent, problems: first, instrument sounds often exhibit time-varying spectra, and grasping this time-varying nature is an important factor to characterize the diversity of each instrument; moreover, in many cases we do not know in advance the number of sources and which instruments are played. Conventional decompositions generally fail to cope with these issues as they suffer from the difficulties of automatically determining the number of sources and automatically grouping spectra into single events. We address both these problems by developing a Bayesian nonparametric fusion of NMF and hidden Markov model (HMM). Our model decomposes music spectrograms in an automatically estimated number of components, each of which consisting in an HMM whose number of states is also automatically estimated from the data.
Keywords :
Bayes methods; audio signal processing; hidden Markov models; matrix decomposition; source separation; Bayesian nonparametric latent source discovery; Bayesian nonparametric spectrogram; audio signal analysis; infinite factorial infinite hidden Markov model; music signal analysis; music transcription; nonnegative matrix factorization; note-level manipulation; source separation; Bayesian methods; Conferences; Hidden Markov models; Instruments; Multiple signal classification; Signal analysis; Spectrogram; Collapsed variational Bayes (CVB); Gamma process; Hierarchical Dirichlet process (HDP); Infinite hidden Markov model (iHMM); Nonnegative matrix factorization (NMF);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011 IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Print_ISBN :
978-1-4577-0692-9
Electronic_ISBN :
1931-1168
Type :
conf
DOI :
10.1109/ASPAA.2011.6082324
Filename :
6082324
Link To Document :
بازگشت