DocumentCode
2357882
Title
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model
Author
Nakano, Masahiro ; Le Roux, Jonathan ; Kameoka, Hirokazu ; Nakamura, Tomohiko ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2011
fDate
16-19 Oct. 2011
Firstpage
325
Lastpage
328
Abstract
This paper presents a Bayesian nonparametric latent source discovery method for music signal analysis. In audio signal analysis, an important goal is to decompose music signals into individual notes, with applications such as music transcription, source separation or note-level manipulation. Recently, the use of latent variable decompositions, especially nonnegative matrix factorization (NMF), has been a very active area of research. These methods are facing two, mutually dependent, problems: first, instrument sounds often exhibit time-varying spectra, and grasping this time-varying nature is an important factor to characterize the diversity of each instrument; moreover, in many cases we do not know in advance the number of sources and which instruments are played. Conventional decompositions generally fail to cope with these issues as they suffer from the difficulties of automatically determining the number of sources and automatically grouping spectra into single events. We address both these problems by developing a Bayesian nonparametric fusion of NMF and hidden Markov model (HMM). Our model decomposes music spectrograms in an automatically estimated number of components, each of which consisting in an HMM whose number of states is also automatically estimated from the data.
Keywords
Bayes methods; audio signal processing; hidden Markov models; matrix decomposition; source separation; Bayesian nonparametric latent source discovery; Bayesian nonparametric spectrogram; audio signal analysis; infinite factorial infinite hidden Markov model; music signal analysis; music transcription; nonnegative matrix factorization; note-level manipulation; source separation; Bayesian methods; Conferences; Hidden Markov models; Instruments; Multiple signal classification; Signal analysis; Spectrogram; Collapsed variational Bayes (CVB); Gamma process; Hierarchical Dirichlet process (HDP); Infinite hidden Markov model (iHMM); Nonnegative matrix factorization (NMF);
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011 IEEE Workshop on
Conference_Location
New Paltz, NY
ISSN
1931-1168
Print_ISBN
978-1-4577-0692-9
Electronic_ISBN
1931-1168
Type
conf
DOI
10.1109/ASPAA.2011.6082324
Filename
6082324
Link To Document